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5 This Application claims priority from U.S. Provisional Patent Application No. 

60/194,419, filed on April 4. 2000. which is incorporated herein by reference in its entirety 
under 35 USC § 119(e). 



FIELD OF THE INVENTION 

10 The present invention relates to fragments of HCV NS3 RNA helicase, including 

mutants, homologues and co-complexes thereof, which are properly folded, soluble, 
monodisperse, and stable in buffered aqueous solutions at physiological pH (4-8). Helicase 
fragments of the invention maintain these properties at concentrations necessary to screen for 
and design specific inhibitors against HCV helicase using NMR, X-ray crystallographic and 

1 5 biological functional assay methods. 



BACKGROUND OF THE INVENTION 

The hepatitis C virus (HCV) causes one of the world's most pandemic and insidious 
diseases. According to the World Health Organization, there are approximately 170 million 

20 carriers worldwide with prevalence up to 0.5 - 10% [Release, Lancet^ 351:1415 (1998)]. In the 
United States, four million individuals are afflicted with hepatitis C [Alter and Mast, 
Gastroenterol Clin North Am , 23:437-455 (1994)], of which 75% to 85% will develop a 
chronic infection. This may ultimately lead to cirrhosis (10% to 20%) and hepatocellular 
carcinoma (1% to 5%) [Cohen, Science^ 285:26-30 (1999)]. The causative agent, HCV, was 

25 identified in 1989 and accounted for 50% to 60% of the non-A, non-B transfusion associated 
hepatitis [Alter et al.,NEnglJMed, 321 T494-1500 (1989); Choo et aL, Science^ 244:359-362 
(1989); Kuo et aL, Science, 244:362-364 (1989)]. More than 100 strains of the virus have been 
identified, and are grouped into six major genotypes which tend to cluster in different regions 
of the world [Simmonds, Current Studies in Hematology and Blood Transfusion, Reesink, ed., 

30 Karger, Basel, pp. 12-35 (1994); van Doom, J Med Vir^ 43:345-356 (1994)]. 

To date, interferon-alpha monotherapy and interferon-alpha-2b and ribavirin 
combination therapy (REBETRON 1 ^ ® (combination therapy containi ng REBETOL® 



< 
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(ribavirin. USP) capsule s and INTRON© A (interferon a l pha-2b« recombinan t) injection) 
Schering-Plough, Kenilworth, NJ) are the only approved treatments. However, in one study 
less than 10% of the patients responded to interferon-alpha monotherapy and 41% of the 
patients responded to REBETRON^^jcombination therapy containing REBETOL© 
5 (ribavirin. USP) capsules and INTRON® A (interferon alpha-2b - recombinant) injection) 
[Reichard et al, Lancet ^ 351:83-87 (1998)]. The most promising antiviral targets in chronic 
HCV infection are the replication enzymes, RNA-binding proteins, viral entry proteins and 
enzymes required for viral maturation. Therefore, it would be advantageous if those skilled in 
the art had the means to develop more effective antiviral agents against the various viral targets 

10 to effectively combat this disease. 

HCV is a member of the Flaviviridae family. It is a positive-sense, single-stranded 
RNA virus with genome size of approximately 9.4 kb [Heinz, Arch Viral Supp± 4:163-171 
(1992); Mizokami and Ohba, GastroenterolJPN, 28 Supp 5:42-44 (1993); Ohba et al, FEBS 
Lett^ 378:232-234 (1996); Takamizawa e/a/., /Wro/^ 65:1 105-1 113 (1991)]. HCV genomic 

1 5 RNA encodes a polyprotein of approximately 3000 amino acid residues: NH2-C- 

El-E2-p7-NS2-NS3-NS4A-NS4B-NS5A-NS5B-cooH [Lohmann et al, J Hepatol ± 24:1 1-19 
(1996); Simmonds, Clin Ther h 18 SuppL S upp B:9-36 (1996)]. The polyprotein undergoes 
subsequent proteolysis by host and viral enzymes to yield mature viral proteins [Grakoui et al, 
J Viroh 67:1385-1395 (1993); Shimotohno et al, J Hepatol, 22:87-92(1995)]. 

20 The NS3 protein has been the target of interest for antiviral discovery because of its 

important roles in HCV maturation and replication. There are two major functional domains: 
the amino-terminal one third of the protein is a serine protease responsible for certain key 
aspects of polyprotein processing [Shimotohno et ai 9 J Hepatol h 22:87-92 (1995)], and the 
carboxy-terminal two thirds shares sequence similarity with the DEAD box family of RNA 

25 helicases [Gorbalenya et al, FEBS Lett, 235:16-24 (1988); Koonin and Dolja, CritRev 
Biochem Moh 28:375-430 (1993); Korolev et al, Protein Science, 7:605-610 (1998)]. 

RNA helicases are grouped into two major superfamilies (SFI and SFII) on the basis of 
the occurrence of seven conserved motifs, a smaller superfamily (SFIII), and two smaller 
families [Gorbalenya and Koonin, Curr Opin Struct Biol, 3:419-429 (1993)]. RNA helicases 

30 are mostly of the SFII superfamily and can be further classified into families on the basis of 
particular consensus sequences in the conserved motifs [de la Cruz et al, TIBS, 24:192-198 



(1999)]. The HCV NS3 RNA helicase is classified as a DExH protein of the SFII superfamily. 
HCV helicase has two enzymatic activities: NTPase, which is believed to provide an energy 
source for the unwinding reaction through NTP hydrolysis, and nucleic acid unwinding [Kim 
et aL, Virus Res, 49:17-25 (1997); Suzich et alj Virol 67:6152-6158 (1993)]. As such, HCV 
5 RNA helicase is essential for replication and production of infectious virions, which makes it 
an excellent target for therapeutics [Kadare and Haenni, J Virol, 71:2583-2590 (1997)]. 
Studies of the crystal structure of HCV helicase reveal that it has three subdomains: subdomain 
I, which contains NTP and Mg* 4 " binding sites; subdomain II, which is believed to contain a 
nucleic acid binding site; and subdomain III, which has an extensive helical structure. A 

10 coupling region lies between subdomains I and II, and is believed to be involved in 

transforming chemical energy into motion associated with unwinding [Kim et aL, Structure, 
156:89-100 (1998); Cho et aL, JBC , 273:15045-15052 (1998); Yao et aL, Nat Struct Biol, 
4:463-467 (1997)]. The functions of some of these motifs have been elucidated by studies of 
the effects of mutations on NTP and RNA binding, NTP hydrolysis and unwinding activity 

15 [Pause and Sonenberg, Curr Opin Struct Biol, 3:953-959 (1993)]. Recently, the basic 

mechanism for RNA duplex unwinding by the DExH RNA helicase NPH-II was described 
[Jankowsky et aL, Nature^ 403:447-451 (2000)], however, in almost all cases the precise 
mechanism and the substrates of these enzymes have not been defined. Therefore, it would be 
beneficial to those skilled in the art to have suitable fragments of the HCV NS3 helicase which 

20 could be used to provide such valuable information and simplify the development of specific 
inhibitors for this enzyme. Nevertheless, there has been no report of an HCV helicase 
subdomain or fragment that is suitable for this purpose. 

To better study the enzymatic properties of the HCV NS3 helicase {e.g., NTP binding, 
single and double stranded nucleic acid binding sites, energy coupling and helicase activity) 

25 and develop potential inhibitors against this enzyme, it is desirable to have suitable fragments 
of the protein for use in methods or techniques such as nuclear magnetic resonance (NMR) 
spectroscopy and X-ray crystallography. For example, recent developments in NMR-based 
drug discovery methods provide a powerful means for identifying and optimizing non-peptide 
drug-like leads, however, such methods are currently limited to proteins having a size of about 

30 less than 30 kDa [Shuker et aL, Science, 274:1531-1534 (1996)] and smaller helicase 

fragments have not been previously reported. The 451 residue HCV NS3 helicase, which is 



about 48.2 kDa, is simply too large for effective use in such methods. Furthermore, to be 
useful, a fragment should be folded correctly, soluble, monodisperse, and stable in a buffered 
aqueous solution close to physiological conditions (pH 4-8 and salt concentrations less than 
about 250 mM). Therefore, it would be advantageous to have fragments of HCV NS3 helicase 
5 that are suitable for the most advanced techniques for characterizing proteins and designing 
inhibitors such as NMR, X-ray crystallography and ATPase assays such as the continuous 
spectrometric assay [Pullman et al.,JBiol Chem^ 235:3322-3329 (I960)]. In addition, such 
fragments should be suitable for probing NTP and nucleic acid binding sites of the HCV NS3 
helicase by NMR and crystallography, which together with mechanistic studies will provide 
10 insights into the mode of unwinding for HCV helicase. 

SUMMARY OF THE INVENTION 

The present invention provides novel fragments of HCV NS3 helicase based on the 
three subdomains I, II, and III. The fragments are properly folded, soluble at millimolar 

15 concentrations, monodisperse, and stable in buffered aqueous solutions under physiological 
conditions (pH 4-8). In addition, the fragments are small (less than about 30 kDa), making 
them useful for NMR-based drug discovery techniques (compared to the full length enzyme 
which is too large for this purpose). The solubility and stability of HCV helicase fragments of 
this invention are easily optimized, as needed, by varying solution conditions and/or by 

20 introducing additional specific mutations into the fragment, as described. The properties of an 
HCV NS3 helicase fragment of the invention allows it to be expressed at high levels in 
conventional expressions systems, such as E. coli, to permit efficient, large-scale production, 
e.g., as [ N]-labeled polypeptide for NMR-based screening applications and production of 
[ 2 H, 13 C, 15 N]- or [ 13 C, 15 N] -labeled polypeptide for structural NMR studies. Thus, the 

25 properties of a fragment makes it useful in the most advanced NMR techniques available, e.g., 
novel NMR-based drug discovery techniques such as SAR-by-NMR [see, e.g., Shuker et al, 
Science^ 274: 1 53 1 -1 534 (1 996) and U.S. Patent No. 5,989,827], in biological functional assays 
to discover inhibitors of HCV NS3 helicase, and to evaluate the mechanism of action and 
substrates for HCV NS3 helicase. 

30 The invention further relates to HCV NS3 helicase fragments in crystalline form, and 

to conditions for crystallizing the same. A crystalline helicase fragment of the invention is 



useful in X-ray crystallography to identify non-peptide drug-like small molecule inhibitors of 
HCV NS3 helicase based on the crystalline structure (including homologues, mutants, and 
co-complexes of crystalline fragments). By detecting the interactions between an inhibitor and 
cyrstalline helicase fragment, the activity of such inhibitors can be further optimized. 
5 Helicase fragments and crystals of this invention are also useful for probing NTP and 

nucleic acid binding sites of HCV NS3 helicase using NMR spectroscopy and X-ray 
crystallography techniques, which, together with mechanistic studies, provide insight into the 
mode of unwinding for HCV helicase. 

Helicase fragments and crystals of this invention also provide methods for determining 

10 the three-dimensional structure (coordinates and atomic details) of such helicase fragments, or 
mutants, homologues or co-complexes thereof, in order to design, computationally evaluate, 
synthesize and use inhibitors of HCV NS3 helicase which may prevent or treat the undesirable 
physical and pharmacological properties of HCV. 

Thus, in one embodiment, the invention provides fragments of HCV NS3 protein 

15 which are derived from amino acids 181 to 324; from amino acids 327 to 481, wherein the 
amino acid residues at positions 431 to 451 are deleted and replaced by the amino acid 
sequence SDG K (SEP ID NO: 2) ; from amino acids 181 to 481, wherein the amino acid 
residues at positions 43 1 to 45 1 are deleted and replaced by amino acids SDG K (SEQ ID NO: 
2Q; and from amino acids 181 to 572, wherein the amino acid residues at positions 328 to 482 

20 are deleted. 

In another embodiment, the invention provides buffered solutions, which contain from 
50 to 1000 jiM of a helicase fragment, from 5 to 15% weight to volume of D2O, a protease 
inhibitor, 25 to 250 mM KPO4, and 1 to 10 mM DTT, wherein the pH of the solution is from 
about 4 to 8. 

25 The invention further provides precipitant solutions which contain from 1 to 60 jag of a 

helicase fragment, from 5 to 40% weight to volume of a precipitant compound, from 1 to 1000 
mM of a salt, and a buffer for a precipitant solution, wherein the pH of the solution is from 
about 4 to 8 and the temperature is from about 1 to 26°C. 

In still another embodiment, the invention provides methods for identifying inhibitor 

30 compounds of HCV helicase protein, which may include obtaining a helicase polypeptide 

fragment, which comprises a subdomain I or subdomain II, and contacting the fragment with a 
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potential inhibitor compound; assaying the fragment in contact with the inhibitor compound 
and the HCV helicase protein for activity based on the subdomain; and comparing the activity 
of the fragment in contact with the compound to the activity of the HCV helicase protein, such 
that a decrease in the activity of the fragment compared with the HCV helicase protein 
5 identifies the compound as an inhibitor of HCV helicase activity. Alternatively, the activity of 
a helicase fragment, which is not in contact with an inhibitor compound, can be compared with 
the activity of a fragment in contact with the compound, instead of a full-length helicase 
protein. 

These and other embodiments of the invention will be appreciated by considering the 
10 following detailed description of the invention and the accompanying Examples. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1A depicts a ribbon diagram of an HCV NS3 RNA helicase. Subdomains I, II, 
and III are shown in white, black, and gray, respectively. 
1 5 Figure IB depicts a ribbon diagram of an HCV NS3 helicase subdomain I construct 

(SEQ ID NO: 3) containing residues 181-324 from HCV-la NS3 helicase. 

Figure 1C depicts a ribbon diagram of an engineered HCV NS3 helicase subdomain II 
construct (SEQ ID NO: 4) containing residues 327-481 from HCV-laNS3 helicase, in which 
residues 431-451 are replaced by the tetra-peptide insertion SDGK (SEQ ID NO: 2) at residue 
20 431. 

Figure ID depicts a ribbon diagram of an engineered HCV NS3 helicase subdomain 
1,11 construct (SEQ ID NO: 5) containing residues 181-481 from HCV-la NS3, in which 
residues 431-451 are replaced by the tetra-peptide insertion SDGK (SEQ ED NO: 2) at residue 
431 . Subdomains I and II are shown in white and black, respectively. 

25 Figure IE depicts a ribbon diagram of an engineered HCV NS3 helicase subdomain 

I,III construct (SEQ ID NO: 6) containing residues 1 8 1 -572 from HCV- 1 a NS3, in which 
residues 328-482 are deleted. Subdomains I and III are shown in white and gray, respectively. 

Figure 2 depicts a two dimensional (2D) 15 N-HSQC NMR spectrum of 200 (iM HCV 
NS3 helicase subdomain 1,11 construct (SEQ ID NO: 5). 

30 Figure 3 depicts a chemical shift index (CSI) for an engineered HCV NS3 helicase 

subdomain II construct (SEQ ID NO: 4). 
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DETAILED DESCRIPTION OF THE INVENTION 

All references cited herein are hereby incorporated by reference in their entireties. 

Molecular Biological Techniques and Definitions 
5 In accordance with the present invention, there may be employed conventional 

molecular biology, microbiology, or recombinant DNA techniques within the ordinary skill of 
the art to prepare viral constructs and helicase fragments of the invention. Such techniques are 
explained fully in the literature. See, e.g., Sambrook, Fritsch & Maniatis, Molecular Cloning: 
A Laboratory Manual, Second Edition (1989) Cold Spring Harbor Laboratory Press, Cold 
10 Spring Harbor, New York (herein "Sambrook et al, 1989"); DNA Cloning: A Practical 

Approach, Volumes I and II (D.N. Glover ed. 1985); Oligonucleotide Synthesis [M.J. Gait ed. 
(1984)]; Nucleic Acid Hybridization [B.D. Hames & S.J. Higgins eds. (1985)]; Transcription 
And Translation [B.D. Hames & S.J. Higgins, eds. (1984)]; Animal Cell Culture [R.I. Freshney, 
ed. (1986)]; Immobilized Cells And Enzymes [TRL Press, (1986)]; A Practical Guide To 
15 Molecular Cloning [B. Perbal (1984)]; Current Protocols in Molecular Biology, John Wiley & 
Sons, Inc. [F.M. Ausubel et al (eds.) (1994)]; [Burleson, Virology: A Laboratory Manual, 
Academic Press, New York (1992)]. 

As used herein, the abbreviations "nt" and "aa" refer to "nucleotide(s)" and "amino 
acid(s)", respectively. 

20 A "nucleic acid molecule" refers to the phosphate diester polymeric form of 

ribonucleosides (adenosine, guanosine, uridine or cytidine; "RNA molecules"), or 
deoxyribonucleosides (deoxyadenosine, deoxyguanosine, deoxythymidine, or deoxycytidine; 
"DNA molecules"), in either a single stranded or a double stranded form. Double stranded 
DNA-DNA, DNA-RNA and RNA-RNA helices are contemplated. The term nucleic acid 

25 molecule, and in particular DNA or RNA molecule, refers only to the primary and secondary 
structure of the molecule, and does not limit it to any particular tertiary forms. The structure of 
a particular nucleic acid molecule, sequence or region may be described herein according to 
the normal convention of providing a sequence in the 5' to 3' direction. A "recombinant" DNA 
molecule has undergone a molecular biological manipulation. 

30 The term "gene" means a DNA sequence that encodes or corresponds to a particular 

sequence of amino acids, which comprise all or a portion of one or more proteins or enzymes. 
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Preferably, if a gene encodes only a portion or fragment of a protein or enzyme, then it encodes 
a functional portion (e.g., a subdomain) that has an activity present in the full length protein or 
enzyme. For example, a viral gene encoding an HCV NS3 helicase may encode the entire 
helicase domain or it may encode a fragment thereof. 

A "subdomain' 1 refers to a segment of amino acids of a protein or polypeptide that has a 
particular property, e.g. , nucleic acid unwinding activity, NTPase activity or ATP binding or 
catalytic activity. "Subdomain I" refers to a fragment of HCV NS3 which corresponds to aa 
181 to 327; "subdomain II" refers to a fragment of HCV NS3 which corresponds to aa 328 to 
483; "subdomain III" refers to a fragment of HCV NS3 which corresponds to aa 484 to 631. 

A "fragment" refers to a segment of amino acids derived from an HCV NS3 helicase 
protein. A fragment preferably includes a subdomain or a fragment thereof, but may also 
comprise an entire domain. An HCV helicase fragment of the present invention has a 
molecular mass (size) between about 5 and 30 kDa, which can be assessed using conventional 
techniques in the art, e.g. , SDS PAGE. The smaller size allows effective use with the most 
advanced NMR methods. Modification(s) to fragments of the present invention (e.g., variants) 
are contemplated and described in greater detail below. 

"Monodisperse" and "predominantly uniform molecular species", in reference to an 
HCV helicase fragment of the present invention, can be used interchangeably to indicate that 
the mean radius of particles comprising the HCV helicase fragment varies by less than 30%, 
preferably less than 15 %, as determined by, e.g., conventional dynamic light scattering 
methods. A monodisperse helicase fragment in solution preferably exists in a monomelic form, 
however, oligomers (e.g., dimers, trimers tetramers, etc.) may exist too. Such oligomeric 
forms of a helicase fragment preferably have a molecular weight of less than about 30 kDa. 

As used herein, "helicase fragment" or "helicase protein", refer to a polypeptide 
derived from an HCVNS3 gene, e.g., HCV-laNS3 [Rice, in Fields Virology, 3 rd ed. (B.N. 
Fields et al, eds., Raven, New York) p. 615 (1996); SEQ ID NO: 1], which polypeptide 
exhibits one or more properties of HCV NS3 helicase activity. A helicase fragment preferably 
lacks any portion of HCV NS3 that exhibits protease activity, e.g., that portion of the HCV 
NS3 protease located in aa 1 to 180 of SEQ ID NO: 1. A helicase fragment of this invention is 
(1) structurally sound (i.e., it folds properly in comparison with a full length HCV NS3 
helicase protein based on NMR or crystallography studies), (2) soluble (i.e., it folds properly 



upon expression such that the polypeptide fragment does not form inclusion bodies, aggregate, 
or require the use of a solvent or other reagent to induce the proper folding of the enzyme in 
comparison with the full length helicase protein), (3) stable in a buffered solution (e.g., the 
protein maintains a conformation that is properly folded in buffered solutions, which can be 
used for NMR or crystallography applications, in comparison with the full length HCV 
helicase protein, for a period of time needed to perform the NMR or x-ray crystallography 
study, typically for at least two weeks), and (4) monodisperse (i.e., it exists as a predominantly 
uniform molecular species in solution where the size of the uniform molecular species is 
suitable for NMR and x-ray crystallography studies). 

A "sequence-conservative variant" of a gene contains a change of one or more 
nucleotides in a given codon position, which results in no alteration in the amino acid encoded 
at that position. A "function-conservative variant" contains a change to one or more 
nucleotides which causes an alteration in an amino acid residue in the protein or enzyme, 
including, but not limited to replacement of an amino acid for another having similar properties 
(such as, for example, polarity, hydrogen bonding potential, acidic, basic, hydrophobic, 
aromatic, and the like). The resulting amino acid in a function-conservative variant does not 
alter the overall conformation or function of the polypeptide. 

A "coding sequence" or a sequence "encoding" an expression product, e.g., RNA, 
polypeptide, protein, or enzyme, is a nucleotide sequence that, when expressed, results in the 
production of that expression product. A coding sequence is "under the control" or 
"operatively associated with" transcriptional and translational control sequences in a cell when 
an RNA polymerase transcribes the coding sequence into mRNA, which can then be translated 
into a protein encoded by the coding sequence. 

The terms "express" and "expression" mean allowing or causing the information in a 
gene or DNA sequence to become manifest, e.g., producing a protein by activating the cellular 
functions involved in transcription and translation of a corresponding gene or DNA sequence. 
A DNA sequence can be expressed using in vitro translation assays or in or by a cell to form an 
"expression product" such as a mRNA or a protein. The expression product, e.g. the resulting 
protein, may also be referred to as "expressed". 
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Subdomains ofHCVNS3 helicase 
The 45 1 residue helicase of HCV NS3 has three nearly equal-sized subdomains, which 
form a triangular-shaped molecule approximately 65 A on a side and 35 A thick (see Fig. 1 A). 
5' Subdomains I and III share a more extensive interface together than either shares with 

subdomain II. Therefore, the amino- and carboxy-terminal subdomains are closely packed and 
form a rigid unit, whereas the second subdomain is flexibly linked to the remainder of the 
structure and can rotate as a rigid body. 

In one embodiment of the invention, fragments are based on subdomain I of HCV NS3 

1 0 helicase, i. e. , the "NTPase subdomain" which includes NTP -binding residues (aa 1 8 1 to 327 of 
SEQ ID NO: 1) within the nucleotide binding fold shared by other NTPases (Figure IB). Thus, 
fragments of subdomain I can be prepared, e.g., from aa 181 to 324 of SEQ ED NO: 1, using 
conventional molecular biology cloning techniques. A fragment is exemplified by SEQ ID 
NO: 3, however variants are contemplated as described in greater detail below, e.g., a 

15 polypeptide fragment of aa 190 to 327 of SEQ ID NO: 1 would be functional. Assays for 
determining NTPase activity of helicases are well known in the art, and such assays are 
contemplated for determining activity of a helicase fragment of subdomain I in the absence or 
presence of inhibitor compounds [see, e.g., Howe et al, Protein Science^ 8:1332-1341(1999) 
for discussion of helicase protein activity assays]. 

20 Higher concentrations of fragments derived from subdomain I (about 1 mM) can be 

prepared without aggregation of polypeptides by making additional substitutions within this 
subdomain. Such an improvement in the solubility of fragments derived from subdomain I, at 
higher protein concentrations, is beneficial for NMR-based drug discovery techniques (see, 
e.g. , U.S. Patent No. 5,989,827), since aggregation will interfere with NMR techniques. Thus, 

25 to further improve the solubility and maintain the desirable monomelic state of fragments of 
the present invention, fragments derived from HCV helicase subdomain I that are desirable at 
higher concentrations can be prepared with an amino acid substitution at either aspartic acid 
(Asp) 249 or arginine (Arg) 257. Substitutions to both residues in the same polypeptide 
fragment will likely not improve the solubility of the fragment. Amino acid residues that can 

30 be employed for substitution include nonpolar amino acids, e.g., alanine, valine, leucine, 

isoleucine, and phenylalanine. The preferred substitutions for aspartic acid 249 include lysine 



and arginine. The preferred substitutions for arginine 257 include glutamic acid and aspartic 
acid. 

Fragments of the invention are also based on subdomain II of HCV NS3 helicase, i.e., 
the U RNA binding subdomain" which includes an Arg-rich (aa 460 to 468) sequence that is 
required for RNA unwinding. Fragments of subdomain II can be prepared, e.g. , from aa 327 to 
481 of SEQ ID NO: 1, using conventional molecular biology cloning techniques. A fragment 
based on subdomain II is exemplified by SEQ ID NO: 4 {see also Fig. 1C), however variants 
are contemplated as described in greater detail below, e.g., a polypeptide fragment of aa 327 to 
489 of SEQ ID NO: 1 would also be functional. Assays for determining RNA binding kinetics 
of helicases are well known in the art, and such assays are contemplated for determining 
activity of a helicase fragment of subdomain II in the absence or presence of inhibitor 
compounds [see, e.g., Howe et al, Protein Sciences, 8:1332-1341(1999) for discussion of 
helicase protein activity assays]. 

Fragments of the invention comprising subdomain II can be improved, e.g. , have better 
folding properties, by reducing the size of an antiparallel (3-loop at aa 43 1 to 45 1 . Hydrophobic 
patches of residues may contribute to formation of inclusion bodies or to aggregation of 
polypeptides containing this subdomain. Using the sequence of SEQ ID NO: 1 as an example, 
it is preferred that at least residue 438 is removed from a fragment including subdomain II, 
more preferably residues 431 to 451 are removed. Shorter or longer deletions 
carboxy-terminal or amino-terminal to residues 431 to 451 are permissible as needed, but in 
any case should not exceed residues 430 to 452. Engineered loops can be constructed by 
deleting residues in the amino-terminal portion of subdomain II (e.g., aa 430-438), and in the 
carboxy-terminal portion (e.g., aa 444-452) followed by insertion of linkers containing two to 
six residues. An insertion to replace the antiparallel P-loop of subdomain II can significantly 
improve the solubility and stability of an HCV helicase fragment of the invention containing 
this subdomain. It is preferred that amino acid sequence SDGK (SEQ ID NO: 2) is inserted for 
deletions of the antiparallel p-loop. Other possible insertions include QGGA (SEQ ED NO: 7), 
RGST (SEQ ID NO: 8), RGPG (SEQ ID NO: 9), SKGE (SEQ ID NO: 10), EQGA (SEQ ID 
NO: 1 1), RNNQ (SEQ ID NO: 12), ADGS (SEQ ID NO: 13), and CDGL (SEQ ID NO: 14). 
Examples of these fragments are provided by SEQ ID NOS: 4 and 5. 
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Fragments can also be prepared from subdomains III of HCV NS3 helicase, i.e., the 
"a-helical subdomain", which can be derived from aa 484 to 63 1 of SEQ ID NO: 1 . 

Fragments, which include combinations of segments from different subdomains of 
HCV NS3 helicase, are also contemplated. For example, fragments of the invention can be 
5 prepared from subdomains I and II, I and III, or II and III. In a specific embodiment described 
in the Examples below, fragments are prepared from subdomains I and II {e.g., SEQ ID NO: 5; 
Fig. ID) and subdomains I and III {e.g., SEQ ED NO: 6; Fig. IE). 

Fragments of the invention which are based on subdomains I and II can be prepared, 
e.g., from aa 181 to 483 of HCV-la NS3, using conventional molecular biological techniques 
10 and the rationale set forth above for preparing fragments based on subdomains I and II 
individually. 

Inspection of the three-dimensional structure of the HCV NS3 helicase reveals that the 
carboxy-terminus of subdomain I and the amino-terminus of subdomain III are very close to 
each other (approximately 4 A) in tertiary structure. Subdomains I and III can be linked by 

15 removing all residues of subdomain II, e.g., aa 328 to 482 of HCV-la NS3. Removal of 

subdomain II reduces the molecular weight of a full-length helicase domain by about 15 kDa. 
To further reduce the molecular weight of fragments that include subdomains I and III, up to an 
additional 59 residues can be deleted from the carboxy-terminus of subdomain III (aa 573 to 
63 1) based on a deletion mutation study. In fragments containing subdomain I and truncated 

20 subdomain III, both a specific nucleic acid binding pocket and the ATP binding pocket are 
preserved {see Fig. IE). 

Variants of HCV helicase fragments 
A polynucleotide encoding a helicase fragment of the present invention can differ in 
nucleotide sequence from another reference polynucleotide encoding the same fragment, e.g. , 

25 helicase from HCV-1 a versus helicase from HCV-lb. A change in the nucleotide sequence of 
the variant may be silent, i.e., it may not alter an amino acid encoded by the polynucleotide. 
Where an alteration is limited to a silent change of this type a variant will encode a polypeptide 
with the same amino acid sequence as the reference polypeptide {i.e., sequence-conservative 
variant). Changes in the nucleotide sequence of the variant may alter the amino acid sequence 

30 of the polypeptide encoded by the reference polynucleotide in its nucleotide or amino acid 
sequence, as described below. Thus, an HCV NS3 helicase polypeptide fragment of the 
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invention can differ in amino acid sequence from another reference HCV NS3 helicase 
polypeptide fragment of the invention, i.e., a variant. A "variant" can be a polynucleotide or 
polypeptide that differs from a reference polynucleotide or polypeptide, respectively. Such 
variants of a helicase fragment of the invention, as described herein, are contemplated for use 
in various assays and biological techniques such as NMR and crystallography in a similar 
manner as described for the non-variant helicase fragment. 

As used herein, the "reference" polynucleotide or protein is derived from HCV- la for 
purposes of example. Since fragments of the invention are derived from strains/isolates of 
HCV, differences in amino acid sequences of NS3 helicase are limited so that the sequences of 
the reference and the variant are closely similar overall and identical in many regions. A 
variant and reference polypeptide may differ in amino acid sequence by one or more mutations, 
substitutions, additions, deletions, truncations (deletion of residues from the amino-terminus, 
carboxy-terminus or both), fusion proteins or synthetic changes, e.g., pegylation. Such 
modifications, which may be present in any combination, are well known in the art and 
discussed in greater detail below. 

A variant may have (i) one or more amino acid residues substituted with a conserved or 
non-conserved amino acid residue (preferably a conserved amino acid residue, e.g. , Gly/Ala, 
Asp/Glu, Val/Ile/Leu, Lys/Arg, Asn/Gln and Phe/Trp/Tyr) and such substituted amino acid 
residue may or may not be encoded by the genetic code, or (ii) one or more amino acid residues 
that includes a substituent group resulting in a natural or non-naturally occurring amino acid, 
e.g., aliphatic esters or amides of the carboxy-terminus or of residues containing carboxyl side 
chains, O-acyl derivatives of hydroxyl group-containing residues, and N-acyl derivatives of 
the amino-terminal amino acid or amino-group containing residues (e.g. lysine or arginine), 
phosphorylated amino acid residues (e.g. , phosphotyrosine, phosphoserine or 
phosphothreonine), sulfonation, biotinylation, or (iii) a mature polypeptide that is fused with 
another compound, such as a compound to increase the half-life of the polypeptide (for 
example, polyethylene glycol), or (iv) additional amino acids not derived from HCV helicase 
fused to the mature polypeptide (i.e., & fusion protein), such as a leader or secretory sequence 
or a sequence which is employed for purification of the mature polypeptide or a pro-protein 
sequence. A fragment of the invention may further have combinations of these modifications. 
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If a fusion protein is desirable, the HCV NS3 helicase fragment can be either at the 
amino or carboxy termini of the fusion protein. Suitable functional enzyme fragments are 
polypeptides that exhibit a quantifiable activity when expressed fused to the HCV NS3 
helicase fragment. Exemplary enzymes include, without limitation, P-galactosidase ([3-gal), 
5 (3-lactamase, horseradish peroxidase (HRP), glucose oxidase (GO), human superoxide 

dismutase (hSOD), urease, and the like. These enzymes are convenient because the amount of 
fusion protein produced can be quantified by means of simple colorimetric assays. 
Alternatively, one may employ fragments or antigenic proteins, to permit simple detection by 
metal-binding columns and quantification of fusion proteins using antibodies specific for the 

10 fusion partner. A histidine tag of six histidine residues at a terminus of a fragment of the 
invention, preferably at the amino terminus, allows easy purification of fragments using 
methods well known in the art. 

Still other modifications can be prepared by the use of agents known in the art for their 
usefulness in cross-linking proteins through reactive side groups. Preferred derivatization sites 

15 with cross-linking agents are free amino groups, carbohydrate moieties and cysteine residues. 

Preparation of helicase fragments 
Various pathogenic and attenuated strains of HCV are known in the art [see Lohmann 
et aL, J Hepatol , 24:1 1-19 (1996); Rice, in Fields Virology, 3 rd ed. (1996), B.N. Fields et aL, 
eds., Raven, New York, p. 615], and can be used to prepare a helicase fragment of the 

20 invention. Fragments can be prepared from a purified, naturally occurring form of an HCV 
NS3 or a recombinant form having a natural or engineered modification, e.g., substitution, 
deletion, insertion, inversion, or other change resulting in a variant, that may change a 
characteristic of HCV NS3 helicase fragment or have no observable effect. One or more 
amino acid changes to an HCV NS3 helicase fragment of the invention that results in a 

25 sequence- or function-conservative variant is contemplated by the invention. 

Helicase fragments can also be prepared synthetically, based on the sequences 
disclosed herein (e.g., aa 181-631 of SEQ ID NO: 1, which sets forth the helicase domain of 
HCV- la) using a variety of techniques well known in the art, e.g., chemical synthesis, 
site-directed mutagenesis [Gillman et aL, Gene^ 8: 81 (1979); Roberts et aL, Nature, 328:731 

30 (1987); Innis, in PCR Protocols: A Guide to Methods and Applications, Academic Press, New 
York, NY (1990)], polymerase chain reaction methods, automated oligonucleotide synthesis 
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[e.g., see Warner, DNA^ 3: 401 (1984)], and polypeptide synthesis (Atherton et al, in Solid 
Phase Peptide Synthesis: A Practical Approach, 1989, IRL Press, Oxford). Adding epitope 
tags for purification or detection of recombinant products is also contemplated. In a particular 
embodiment, described infra, a His tag is used in the preparation of fragments of the invention. 

Conventional molecular biology and virology techniques can be used to obtain HCV 
strains/isolates, e.g., from a partial genomic sequence of a known strain, e.g., HCV- la or 
HCV-lb [Rice, CM., in Fields Virology, 3 rd ed. (B.N. Fields et al, eds.), p. 615 (1996)]. A 
nucleic acid encoding HCV NS3 helicase domain can be prepared from any available 
strain/isolate of HCV and a fragment generated therefrom. To facilitate the teaching of the 
invention, fragments are described using the amino acid sequence of HCV NS3 derived from 
HCV-1 a strain (SEQ ID NO: 1) by way of example. It shall be appreciated that other strains of 
HCV, which may have a helicase domain that is not identical to HCV-1 a {e.g., a helicase 
variant) can be used to prepare fragments of the invention. 

Expression systems 

Various conventional expression systems can be employed to express an HCV NS3 
helicase fragment of the invention, including prokaryotic {e.g., bacterial), eukaryotic {e.g., 
mammalian, yeast, and insect), and cell-free in vitro systems, which are commonly known in 
the art. To prepare HCV NS3 helicase fragments, conventional molecular biology techniques 
can be used to subclone HCV NS3 helicase polynucleotide encoding a fragment into a suitable 
expression vector, which is transformed into a suitable host and the fragment coding sequence 
expressed. For detailed methodologies see e.g., Sambrook supra. Preparation of helicase 
fragments for use in expression vectors is described in specific embodiments set forth in the 
Examples, infra. It is noted that the present invention is not limited to use of any particular 
vector or methodology described in the Examples below, which are provided for purposes of 
further illustrating the invention. 

Both prokaryotic and eukaryotic host cells can be used to express a desired HCV NS3 
helicase coding sequence when appropriate control sequences compatible with the selected 
host are used. Among prokaryotic hosts, E. coli is advantageous and preferred for expression 
of HCV NS3 helicase fragments because bacteria are easier to manipulate and higher 
quantities of protein expression can be achieved than with other available expression systems. 
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Cloning and expression vectors suitable for the needs of the practitioner can be selected 
from various, commercially available vectors that are compatible with prokaryotic hosts, e.g., 
pBR322, pUC, pET, and which also contain marker sequences conferring antibiotic resistance. 
The foregoing systems are particularly compatible with E. coli. Other prokaryotic hosts, e.g., 
5 strains of Bacillus and Pseudomonas, can be used with compatible control sequences known to 
those of ordinary skill in the art. In specific embodiments, described infra, the vector 
pET28b(+) (Novagen, Madison, Wisconsin) is used to express HCV NS3 helicase fragments. 
It is noted that due to a subcloning artifact from pET28b(+), these constructs have a G-S-H-M 
polypeptide sequence at the amino-terminus. Numerous expression control sequences are 

10 available for prokaryotes, including promoters, optionally containing operator portions, and 
ribosome binding sites, e.g., T7 bacteriophage promoter [Dunn and Studier, JMol Biol^ 166: 
477 (1983)], p-lactamase (penicillinase) and lactose promoter systems [Chang et al, Nature^ 
198: 1056 (1977)], tryptophan (tip) promoter system [Goeddel et al, Nuc Acids Res^ 8: 4057 
(1980)], A,-derived ?i promoter and N gene ribosome binding site [Shimatake et al, Nature^ 

15 292: 128 (1981)] and hybrid tac promoter [De Boer et al.,Proc Nat Acad Sci USA^ 292:128 
(1983)]. 

Eukaryotic hosts can be used as desired, including without limitation, yeast {e.g. , 
Saccharomyces, Klebsiella, Picia, and the like) and mammalian cells in culture systems. 
Yeast-compatible vectors and control sequences are well known in the art and can carry 

20 markers that permit selection of successful transformants by conferring prototrophy to 

auxotrophic mutants or resistance to heavy metals on wild-type strains. Mammalian cell lines 
available as hosts for expression are known in the art and include many immortalized cell lines 
available from the American Type Culture Collection (ATCC), including HeLa cells, Chinese 
hamster ovary (CHO) cells, baby hamster kidney (BHK) cells, and a number of other cell lines. 

25 Suitable promoters for mammalian cells are also known in the art, and include viral promoters 
from, e.g., Simian Virus 40 (SV40) [Fiers et al,Nature t 273: 1 13 (1978)], Rous sarcoma virus 
(RSV), adenovirus (ADV), and bovine papilloma virus (BPV), glyceraldehyde-3 phosphate 
dehydrogenase (GAPDH) promoter or alcohol dehydrogenase (ADH) regulatable promoter, 
terminators also derived from GAPDH, and if secretion is desired, a leader sequence derived 

30 from yeast a-factor (see U.S. Pat. No. 4,870,008). Mammalian cells may also require or 

benefit from terminator sequences, e.g., derived from the enolase gene [Holland, J Biol Chem & 
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256: 1385 (1981)], and poly-A addition sequences, enhancer sequences (which increase 
expression), sequences which promote gene amplification, e.g., methotrexate resistance genes, 
which are known in the art. 

Transformation of a host cell with a vector containing a polynucleotide sequence 
5 encoding an HCV NS3 helicase subdomain is accomplished using known methods in the art 
for introducing nucleic acid into cells, and will typically depend upon the host to be 
transformed, [see, e.g., Cohen, Proc Nat Acad Sci USA* 69: 21 10 (1972); Hinnen et al, Proc 
Nat Acad Sci USA, 75: 1929 (1978); Graham and Van der Eb> Virol, 52: 546 (1978)]. 

Isolation and purification of expressed HCV NS3 helicase fragments 

10 After expression of an HCV NS3 helicase fragment, HCV NS3 helicase polypeptide 

fragments can be isolated and purified according to conventional methods in the art, typically 
depending upon the type of expression system used. In specific embodiments, illustrated infra, 
HCV NS3 helicase fragments are expressed from pET28b(+) in E. coli and isolated by lysing 
cells and centrifuging to obtain the supernatant which contains the HCV NS3 helicase 

15 fragment. The supernatant is subjected to Ni 2+ chelation chromatography to purify the 

fragment, which binds to the column due to the presence of an amino-terminal His tag on the 
fragment. The isolated fragment is then proteolytically cleaved with thrombin to remove the 
histidine tag. These fragments have a four residue sequence G-S-H-M at the amino terminus, 
which does not effect the function of the fragments. After thrombin proteolysis, the fragment 

20 of interest are separated from the histidine tag, e.g., by size exclusion chromatography. 

ATPase assay 

ATPase assays can be performed to determine steady state kinetic parameters of HCV 
helicase using helicase fragments that contain at least subdomain I, such as by a continuous 
spectrophotometric assay [Pullman et al, J Biol Chem, 235: 3322-3329 (I960)]. Such an 

25 assay is also useful for comparing the ATPase activity of a fragment that is bound to or in a 
complex with an inhibitor compound with the activity of a full-length helicase protein or 
fragment not bound to an inhibitor. 

Initial ATPase rates can be measured at constant ATP concentrations without 
accumulation of ADP product (e.g., using a fragment based on subdomains 1,11 versus 

30 subdomain I or subdomain I,III). ATPase catalyzed adenosine diphosphate (ADP) formation is 
coupled to oxidation of NADH by the enzymes pyruvate kinase (PK) and lactate 



dehydrogenase (LD) and an excess concentration of the intermediate substrate 
phospho(enol)pyruvate (PEP). The assay permits ATPase rates to be monitored by the change 
in absorption at 340 nm. 

NMR sample preparation and NMR characterization 
5 Purified protein fragment samples, e.g. by gel filtration, can be concentrated to a 

desired concentration for NMR experiments (from about 50 to 1000 |aM helicase, preferably 
about 200 \M) plus about 5 to 15% D 2 0 (preferably about 10%), and conventional protease 
inhibitors, e.g., aprotinin, leupeptin, AEBSF [4-(2-Aminoethyl)-benzenesulfonyl fluoride], 
and Protease Inhibitor Cocktail I (Calbiochem, San Diego, CA), wherein the pH is pH 4 to 8.0, 

10 preferably pH 6 to 7. Alternatively, a buffer other than a gel filtration buffer can be used and 
exchanged using a desalting column, eg., 25 to 250 mM KP0 4 (preferably 75 mM), 25 to 250 
mM NaCl (preferably 50 mM), 1 to 10 mM DTT (preferably about 5 mM), 0.010 to 0.020% 
NaN3 (preferably 0.015%), wherein the pH is adjusted to about 6.5. The protein solutions are 
then transferred into NMR tubes for NMR studies. Two-dimensional 15 N-HSQC NMR spectra 

15 of the [ l5 N]-labeled HCV NS3 helicase fragments are acquired at 25°C to assess the folding 
and stability of the fragments. The number of peaks and their dispersion in the 2D 15 N-HSQC 
NMR spectra are indicative of fully folded proteins. The line widths of the peaks in the NMR 
spectra should be consistent with the molecular weight of the various HCV NS3 helicase 
fragments to indicate a fragment is monomelic under the conditions tested. 

20 A preferred buffer for use in NMR for fragments of the invention includes 50 to 1000 

(j,M of a helicase fragment, from 5 to 15% weight to volume of D2O, a protease inhibitor, 25 to 
250 mM KPO4, and 1 to 10 mM DTT, wherein the pH of the solution is from about 4 to 8. 
Additional components, including 25 to 50 mM NaCl (preferably aobut 50 mM) and 0.010 to 
0.02% NaN 3 (preferably about 0.015%), may be added to this buffer to enhance the unique 

25 properties of helicase fragments. 

NMR titration experiments to determine binding of adenosine triphosphate (ATP) 
Binding of adenosine triphosphate (ATP) to HCV NS3 helicase subdomain constructs 
can be determined using standard NMR titration experiments [Lian and Roberts, in NMR of 
Macromolecules, (Roberts, E.D., ed.) Oxford University Press, pp. 153-182 (1993)]. Such 

30 experiments are well suited for determining the interaction site of ligands with proteins and 
allow determination of dissociation constants of weak molecular interactions (Ka > 1 pM). 



Known amounts of ATP are added incrementally to NMR samples of [ 15 N]-labeled HCV NS3 
helicase subdomain constructs of known concentration. Two dimensional 15 N-HSQC NMR 
spectra are collected after each addition of ATP. The dissociation constant (Kd) of ATP is 
derived from an analysis of the changes in amide chemical shifts of residues in the binding site 
5 of the protein as a function of the concentration of ATP. 

NMR resonance assignments and secondary structure determination 
In the initial stage of any investigation by NMR spectroscopy, each nuclear magnetic 
resonance must be associated with a specific nucleus in the protein under investigation. 
Resonance assignments must be "sequence-specific", i.e., each resonance must be assigned to 

10 a spin in a particular amino acid residue in the protein sequence. NMR spectroscopy provides 
three types of information useful for spectral assignments: through-bond interactions (via 
scalar couplings), through-space interactions (via dipolar coupling), and chemical 
environment (via the chemical shift). The strategies employed for resonance assignments 
depend on the size of the protein under investigation and whether only homonuclear l H NMR 

1 5 spectra are available (unlabeled proteins) or whether C and N heteronuclear correlation 
spectra are available (isotopically labeled proteins). 

Conventional homonuclear multi-dimensional NMR techniques can be employed 
using unlabeled proteins to determine structures of proteins up to about 100 residues [e.g., 
Wuthrich, in NMR of Proteins and Nucleic Acids, Wiley, New York (1986); Wuthrich, Science^ 

20 243:45-50 (1989); Clore and Gronenborn, Ann Rev Biophys Chem^ 21 :29-63 (1991)] which are 
comparable in quality to 2-2.5 A resolution X-ray structures [Clore and Gronenborn, J Mol 
Biol± 221:47-53 (1991)]. However, for proteins larger than about 100 residues, such as 
helicase fragments of the present invention, conventional homonuclear assignment strategies 
can no longer be applied successfully and multi-dimensional heteronuclear NMR experiments 

25 must be employed using isotopically labeled proteins [e.g., Clore and Gronenborn, in NMR of 
Proteins, Clore and Gronenborn, eds., CRC Press, Boca Raton, pp 1-32 (1993)]. For the 
present invention a combination of standard double- and triple-resonance experiments to 
achieve NMR resonance assignments of isotopically labeled HCV NS3 helicase fragments 
[e.g., Markley and Kainosho, in NMR ofMacromolecules: A Practical Approach, Roberts, ed., 

30 IRL Press, Oxford, pp 101-152 (1993); Cavanagh et al, inProtein NMR Spectroscopy: 
Principles and Practice, Academic Press, San Diego, pp 410-556 (1996)] can be used. 
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Details of the local backbone geometry can be obtained by an extension of the 
sequential assignment process; the relative intensities of c1nn (NOE between amide protons), 
d a N (NOE between alpha proton and amide proton), and dpN (beta proton and amide proton) 
NOE cross-peaks and the measurement of the backbone 3 Jhnhci (intra-residue three-bond 
5 coupling constant between amide proton and alpha proton) are required. The combination of 
sequential NOE and 3 Jhnhoc coupling constant data together with medium range and a few long 
range NOEs is capable of providing details of the regions of regular secondary structure within 
the protein. Evidence of regular secondary structures can be corroborated by analysis of the 
amide exchange rates. The elements of secondary structures can be connected together to give 
10 a crude view of the global fold by the identification of a few key long-range NOEs. Thus, 

without recourse to extensive calculations and data analysis, important structural details (albeit 
of low absolute resolution) can be obtained in a straightforward manner [e.g., Barsukov and 
Lian, in NMR of Macromolecules: A Practical Approach, Roberts, ed., IRL Press, Oxford, pp 
315-357 (1993)]. 

15 In addition to the NOE, coupling constant, and amide exchange data, it has been well 

established in recent years to use information that is contained in the chemical shift data of the 
protein to derive its secondary structure. The nuclear chemical shift is very sensitive to its 
local electronic environment. Since the chemical shifts of the protein, especially those of l H a , 
13 C a , 13 C P , and 13 C nuclei, are correlated with its secondary structure, they can provide 

20 important information regarding the secondary structure of the protein [e.g., Spera and Bax, J 
Am Chem Soc, 113:5490-5492 (1991); Wishart et aL, Biochemistry, 31:1647-1651 (1992); 
Wishart and Sykes, Methods in Enzymology, 239:363-392 (1994); Cornilescu et al. 9 JBiomol 
NMR, 13:289-302 (1999)]. Among various empirical approaches to extract structural 
information from chemical shift data, the chemical shift index method has been widely 

25 accepted in the NMR community. In this approach a chemical shift index (CSI) is assigned to 
each residue of the protein based on a comparison between the chemical shifts of the l H a , 13 C a , 
13 C P , 13 C nuclei which are determined on the folded protein with those corresponding to 
random coil chemical shifts. The secondary structure elements can then be identified by 
examination of these chemical shift indices according to established rules [Wishart and Sykes, 

30 Methods in Enzymology, 239:363-392 (1994)]. Thus, NOE, coupling constant, amide 
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exchange data, and CSI data can be used to confirm the secondary structure elements and the 
global folds of HCV NS3 helicase fragments and full length protein. 

The fragments of the invention are ideal for use in NMR-based drug discovery 
techniques to discover, optimize, and synthesize chemical entities, including inhibitory 
5 compounds that are capable of binding to HCV NS3 helicase fragments or any protein thereof 
Assignments of the amide resonances of the target protein are an important step in these 
processes. This will allow determination of the location of the ligand-binding site(s) by 
analyzing the specific amide signals of the protein that change upon the addition of the 
compound. Thus, "hits" can immediately be judged inadequate if they are observed to disrupt 

10 the protein fold or bind to an undesired location. Having various subdomain HCV NS3 

helicase constructs, it is possible to obtain resonance assignments for the smaller constructs 
first and then correlate them with the larger multi-domain constructs. This approach greatly 
simplifies and accelerates the assignment process of the larger multi-domain HCV NS3 
helicase constructs. This is particularly true for constructs derived from HCV NS3 helicase 

15 subdomains I and II (e.g., subdomain I,IIA derived from amino acids 181-430,SDGK,452-481 
of HCV NS3) since the domain-domain interactions between domain I and II are very much 
localized and minimal. To obtain backbone assignments for proteins of molecular weight 
smaller than about 20 kDa is relatively easy and fast with current NMR methodologies. In 
contrast, this is still a challenge and a much slower process for larger polypeptides, such as 

20 fragments of HCV NS3 subdomain I,IIA (e.g., fragments derived from 
181-430,SDGK,452-481 of HCV NS3). 

Crystallization and X-ray Crystallographic Analyis 
Another aspect of the invention relates to preparation of crystals of HCV NS3 helicase 
fragments. Preferably, an HCV NS3 helicase fragment is produced recombinantly in E. coli 

25 and initial purification is accomplished by nickel chelate chromatography, as described supra. 
This HCV NS3 helicase subdomain preparation may be subjected to anion exchange 
chromatography for further purification. It may also be desirable to subject the HCV NS3 
helicase subdomain preparation to standard size exclusion gel filtration. The protein fragment 
preparation may be further concentrated using any desirable standard technique. Finally, the 

30 preparation can be ultracentrifugated to produce a monodisperse helicase fragment preparation. 
The resulting supernatant is useful for crystallization purposes. 
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To prepare the supernatant for crystallization, a stabilizing solution is added, which 
preferably contains a protein stabilizing agent, a salt, a buffering agent to adjust pH, and 
optionally a reducing agent or an oxygen scavenger is added. The protein stabilizing agent and 
salt maintain the solubility of the HCV NS3 helicase protein fragment preparation. Protein 
5 stabilizing agents, also known as cosmotropic agents, are well known in the art, and include 
polyols, sugars as well as amino acids and amino acid analogs, e.g., erythritol, sorbitol, 
glycerol, fructose, trehalose, proline, (3-alanine, taurine and glycine betaine [see Jeruzalmi & 
Steitz, JMol Biol± 274: 748-756 (1997)]. The concentration of a stabilizing agent will vary 
depending upon the type of agent employed. For example, glycerol is preferably provided in a 

10 concentration range from about 2 to about 20% (w/v), preferably about 10% (w/v). The salt 
may be provided in a concentration from about 0-2000 mM. Many salts are routinely used for 
this purpose. If desired, the reducing agent is present in the buffered solution at a 
concentration of about 10 mM. Examples of reducing agents include dithiothreitol (DTT) and 
dithioerythritol (DET), but it is preferably p-mercaptoethanol (BME). The final pH of the 

15 stablizing solution can range from 3.5 to 8, preferably between pH 5 and 6. 

A "precipitant" compound can be used to decrease the solubility of the polypeptide in a 
concentrated solution. Alternatively, a "precipitant" is a change in a physical or chemical 
parameter, including temperature, pH and salt concentrations, which decreases polypeptide 
solubility. Precipitants induce crystallization by forming an energetically unfavorable 

20 precipitant-depleted layer around the polypeptide molecules. To minimize the relative amount 
of this depletion layer, the polypeptides form associations and ultimately crystals [see Weber, 
Advances in Protein Chemistry - 4 41:1-36 (1991)]. Various precipitants are known in the art 
including, e.g., ammonium sulfate, ethanol, 2-methyl-2,4-pentanediol, and polyglycols. A 
suitable precipitant for crystallization of NS3/NS4A polypeptide complex is polyethylene 

25 glycol (PEG), which combines some of the characteristics of the salts and other organic 
precipitants. In addition to precipitants, other materials can be added to the polypeptide 
crystallization solution, including buffers to adjust the pH of the solution (and hence surface 
charge on the peptide) and salts to reduce the solubility of the polypeptide. 

Crystallization of NS3 helicase fragments of the invention can be accomplished using 

30 any of the various known methods in the art [see e.g., Giege et al., Acta Crystallogr^ D50: 
339-350 (1994); McPherson, EurJBiochem^ 189: 1-23 (1990)]. Such techniques include 
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microbatch, hanging drop, seeding and dialysis. Preferably, hanging-drop vapor diffusion 
[McPherson, J Biol Chem^ 251: 6300—6303 (1976)] or microbatch methods [Chayen, 
Structure^ 5: 1269-1274 (1997)] are used. In each of these methods, it is important to promote 
continued crystal growth after nucleation by maintaining a supersaturated solution. In the 
5 microbatch method, polypeptide is mixed with precipitants to achieve supersaturation, and the 
vessel is sealed and set aside until crystals appear. In the dialysis method, the polypeptide is 
retained in a sealed dialysis membrane which is placed into a solution containing precipitant. 
Equilibration across the membrane increases the precipitant concentration thereby causing the 
polypeptide to reach supersaturation levels. 

1 0 The following crystallization method, which was used to crystallize HCV NS3 helicase 

subdomain I (aa 181-324), can be used to crystallize an HCV helicase fragment. Preferably, 
the protein fragment concentration is at least 1 mg/mL and less than 60 mg/mL. 
Crystallization is achieved in a precipitant solution, which contains a precipitant compound, 
e.g., 2-methyl-2,4-pentanediol, having a concentration from about 5 to 35% (w/v). A protein 

15 stabilizing agent, e.g., 0.5 to 20% glycerol, may also be included as desired. A suitable salt, 
e.g., sodium chloride, can also be added as desired, preferably in concentration ranging from 1 
to 1000 mM. The pH of the precipitant is buffered to about 4.0 to 6.8, most preferably about 
pH 5 to 6. Specific buffers useful in a precipitant solution can vary and are well-known in the 
art e.g., MES, sodium cacodylate, sodium phosphate and sodium acetate [Scopes, Protein 

20 Purification: Principles and Practice, Third ed., Springer- Verlag, New York (1994)]. 

Crystals routinely grow in a wide range of temperatures, however it is preferred that crystals of 
the invention form at temperatures between about 1°C and 26°C, preferably between about 
2°C and 12°C, and most preferably at about 4°C. 

Crystals of the invention have a wide range of uses. For example, high quality crystals 

25 are suitable for X-ray or neutron diffraction analysis to determine the three dimensional 

structure of the corresponding subdomain of HCV NS3 helicase, and in particular to assist in 
the identification of active and effector sites for helicase. Knowledge of these sites and 
solvent accessible residues allow structure-based design and construction of agonists and 
antagonists for HCV NS3 helicase subdomain polypeptide complexes. In addition, 

30 crystallization can be used as a purification method. In some instances, a polypeptide or 

protein crystallizes from a heterogeneous mixture into crystals. Isolation of such crystals by 
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filtration and/or centrifugation, followed by redissolving the polypeptide affords a purified 
solution suitable for use in growing the high-quality crystals necessary for diffraction analysis. 
The crystallizable compositions of the invention can also be used for x-ray crystallography. 

Once a crystal of the present invention is grown, X-ray diffraction data can be collected. 
5 One method for determining structure uses synchrotron radiation, under standard cryogenic 
condition for such X-ray diffraction data collection. Other methods for characterizing crystals 
of the invention include x-rays produced in a conventional source, e.g., a sealed tube or a 
rotating anode, precession photography, oscillation photography and diffractometer data 
collection. 

10 The present invention permits the use of structure-based drug design techniques to 

design, select, and synthesize chemical entities, including inhibitory compounds that are 
capable of binding to HCV NS3 helicase subdomain polypeptide or any portion thereof. Also, 
de novo and iterative drug design methods can be used to develop drugs from the crystal 
structure of the present invention. One particularly useful drug design technique enabled by 

1 5 this invention is structure-based drug design, which optimizes associations between a protein 
and a compound by determining and evaluating the three-dimensional structures of successive 
sets of protein-compound complexes. HCV NS3 helicase fragment complexes suitable for 
crystallography analyses include, for example, a fragment of the invention in complex with a 
small-molecule, e.g., peptide, nucleotide, polynucleic acid (i.e. substrate), peptidomimetic 

20 nucleotide analog or an inhibitor unrelated in structure to substrate, members of the putative 
replicase complex (e.g., HCV NS5B, an RNA dependent RNA polymerase, NS2 or additional 
HCV proteins), one or more cellular host factors, and other molecules commonly used in such 
analyses, or combinations thereof. 

The association of a natural ligand or substrate with the binding pocket of a 

25 corresponding receptor or enzyme is the basis of many biological mechanisms of action. The 
term "binding pocket", as used herein, refers to any region of a molecule or molecular complex 
that favorably associates with another chemical entity or compound as a result of its shape. 
Similarly, drugs may exert their biological effects through association with the binding pocket 
of a receptor or enzyme. Such association may occur with all or any part of the binding 

30 pockets. An understanding of such association for HCV helicase will help to design drugs 
having more favorable associations with the target helicase enzyme, and thus, improved 
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biological effects. Therefore, this information is valuable in designing potential enzyme 
inhibitors against HCV NS3 helicase subdomain polypeptides complexes. 

In iterative structure-based drug design, crystals of a series of protein/compound 
complexes are used to solve the three-dimensional structure of each complex. Such an 
5 approach can provide insight into the association between a helicase protein and inhibitor 
compound by selecting compounds with inhibitory activity, obtaining crystals of the complex, 
solving the three-dimensional structure of the complex, and comparing the associations 
between the complex and previously solved protein. By observing how changes in the 
compound affected the protein/compound associations, an inhibitor compound can be 
10 optimized. 

Iterative structure-based drug design is carried out by forming successive 
protein-compound complexes followed by crystallizing each new complex, or by soaking {i.e., 
a process in which the crystal is transferred to a solution containing the compound of interest) 
a pre-formed protein crystal in the presence of a inhibitor, thereby forming a 

15 protein/compound complex and obviating the need to crystallize each individual 

protein/compound complex. It is an advantage that the HCV NS3 helicase fragment crystals of 
the invention can be soaked in the presence of one or more compounds, such as HCV NS3 
helicase subdomain inhibitors, substrates or other ligands, to provide HCV NS3 helicase 
fragment polypeptide compound crystal complexes. 

20 Structure coordinates of a helicase fragment can be used to determine the 

three-dimensional structure of HCV helicase, molecular complexes of HCV helicase, or 
molecules which contain a structurally similar feature to HCV NS3 helicase. Molecular 
replacement techniques can be used to obtain structural information about a crystallized 
molecule or molecular complex whose structure is unknown by obtaining an X-ray diffraction 

25 pattern from the crystallized molecule or molecular complex, and applying crystallographic 
phases derived from at least a portion of the structure coordinates derived from a helicase 
subdomain to the x-ray diffraction pattern to generate a three-dimensional electron density 
map of the molecule or molecular complex. In addition, the structure of an HCV NS3 helicase 
subdomain-compound complex can be determined from the structure coordinates of a 

30 fragment of the invention. For example, a helicase protein-compound complex can be 
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crystallized and the structure elucidated using methods such as difference Fourier or molecular 
replacement. 

All of the complexes referred to above can be studied using well-known X-ray 
diffraction techniques may be refined versus x-ray data to 3 A resolution or better to an Rf ree 
5 value of about 0.40 or less using computer software, e.g., X-PLOR [Yale University, 1992, 
distributed by Molecular Simulations, Inc.; see e.g., Blundell & Johnson, supra; Meth, 
Enzymol., vol. 1 14 & 1 15, Wyckoff et al, eds., Academic Press (1985)]. This information can 
be used to optimize known HCV NS3 helicase inhibitors, and to design new HCV NS3 
helicase inhibitors. 

10 The following Examples are provided to further demonstrate aspects of the invention, 

and are not intended to limit the invention thereto. 

EXAMPLES 

Example 1 

1 5 Construction, Expression and Purification of HCVNS3 helicase subdomain I 

pNS3(isi-324) was derived from plasmid pJC84 [ Grakoui et ai, J Virol 67:1385 1395 
09»£>Grakoui et aLJ Virol 67:1385-1395 (1993)1 which encodes the entire NS3 region of 
the la strain of HCV (SEQ ID NO: 1) . The gene encoding HCV NS3 helicase subdomain I 
{i.e., residues 181-324 of HCV NS3 helicase from HCV- la ; SEQ ID NO: 3 ) was PCR 

20 amplified from pJC84 using primers which incorporate a Ndel site at the 5' end of the gene and 
a Hindlll site at the 3' end. The PCR product was digested with the appropriate enzymes, gel 
purified and ligated into pet28b(+) (Novagen, Madison, WI), which was also prepared with 
Ndel and Hindlll. The ligation reaction was used to transform competent E. coli XL2-Blue 
(Stratagene, La Jolla, CA) which were selected on LB agar plates with kanamycin (30 |ag/ml). 

25 Recombinant clones were identified by PCR gene amplification and sequencing. The resulting 
plasmid, pNS3 ( i8i-324)> encodes a fusion protein of HCV NS3 helicase subdomain I (181-324) 
carboxy-terminal to a polyHis tag and thrombin cleavage site. 

A single colony from E. coli BL21(DE3) transformed with pNS3 ( i 8 i-324) was used to 
initiate growth in LB broth supplemented with 30 |ng/ml kanamycin. When the cell density 

30 reached an ODeoo of 1-2, the culture was used to inoculate M9 media [Lech and Brent, in 
Current Protocols in Molecular Biology, vol. 1, Ausubel et al. (eds), John Wiley and Sons, 
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New York, (1998)] supplemented with 30 (ig/ml kanamycin and 0.5 ml of 0.1 M thiamine. 
When the cell density reached an OD600 of 0.7-1.0 the cell culture was cooled to 16°C and 
recombinant protein expression was induced with IPTG (ImM final concentration). Cells 
were harvested 16 hours after induction and stored at -20 °C until lysed. 
5 The cell pellet was resuspended in 100 ml/L culture of lysis buffer containing BPER 

(Bacterial Protein Extraction Reagent; Pierce Chemical Company, IL), 300 mM NaCI, 0.2 mM 
DTT, 10% glycerol and 10 mM imidazole, pH 8.4, 5 ml/L protease inhibitor cocktail III 
(Pierce Chemical Company, IL) and 10,000 unit/L Benzonase. The suspension was 
homogenized using a glass homogenizer and incubated at room temperature for 20 minutes 

10 with gentle stirring. The lysate was cleared by centrifugation at 1 86,000 x g for 20 minutes. 
The supernatant was added to 4 ml/L culture of Ni resin which had previously been 
equilibrated in lysis buffer without DTT. The lysate and resin mixture was incubated for 1 
hour at 4°C on a rotator. After 1 hour, the resin was pelleted by centrifugation and resuspended 
with 10 ml of pre-chilled wash buffer consisting of 20 mM Tris-HCl, 25 mM imidazole, 0.2 

15 mM DTT, 500 mM NaCI, 0.1% BOG, and 10% glycerol, pH 8. The resin was pelleted by 
centrifugation and packed into a column. The resin was washed with additional wash buffer 
until the A, max 280 nm stabilized at a value close to zero. The bound recombinant protein was 
eluted with 250 mM imidazole, 1 mM DTT, 500 mM NaCI, and 10% glycerol, pH 8. 10 NIH 
units of thrombin were added per mg of fusion protein and the sample was dialyzed at 4°C for 

20 16 hours against 75 mM potassium phosphate, 1 mM DTT, 20% glycerol, pH 8. The sample 
was then dialyzed against gel filtration buffer (75mM potassium phosphate, 5mM DTT, 
0.01 5% sodium azide, pH 8) for 4 hours. After dialysis the sample was concentrated to 3 ml 
and applied to a Superdex SUPERB EX (gel fi ltration materialV200 size exclusion column (26 
x 60 cm, Amersham Pharmacia Biotech, NJ) equilibrated in gel filtration buffer containing 75 

25 mM potassium phosphate, 5 mM DTT, pH 8. Fractions containing HCV NS3 helicase 
subdomain I (181-324), as judged by sodium dodecyl sulfate-polyacrylamide gel 
electrophoresis (SDS-PAGE), were pooled and concentrated for NMR to approximately 200 
jaM studies containing 75 mM potassium phosphate, 5 mM DTT, pH 8. This procedure 
yielded more than 16 mg of highly pure HCV NS3 helicase subdomain I (181-324) protein per 

30 liter of final E. coli growth culture. The protein was either stored at 4°C if used within a week 
or at ™20°C for long term strorage. 
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Example 2 

Construction, Expression and Purification of HCVNS3 helicase subdomain II A 

The helicase fragment of this example was prepared substantially as described in 
Example 1, except as otherwise noted. 
5 pNS3(327-430,sdgk,452-481) was derived from plasmid pJC84. The gene encoding HCV 

NS3 helicase subdomain IIA (327-430,SDGK,452-481; SEQ ID NO: 4), i.e., residues 327-481 
of helicase derived from HCV- la with residues 431-451 replaced by the amino acid sequence 
SDG K (SEP ID NO: 2\ was constructed from pJC84 in two pieces. The DNA sequence 
encoding residues 327-430 was amplified with a Ndel site in the upstream primer and the 

10 nucleotides encoding S-D-G-K (SEQ ID NO: 2) in the reverse primer. The DNA sequence 
encoding residues 452-481 was amplified with the nucleotides encoding for S-D-G-K (SEP ID 
NO: 2) in the forward primer and a Hindlll site in the reverse primer. The amplified DNA 
fragments were purified and then mixed for another round of PCR using the same forward 
primer used to amplify the DNA encoding residues 327-430 and the same reverse primer used 

1 5 to amplify the DNA encoding residues 452-481 . The resulting products were digested with 
Ndel and Hindlll, purified, and ligated into pet28b(+). The ligation reaction was used to 
transform competent E.coli XL2-Blue which were selected on LB agar plates with kanamycin 
(30 jag/ml). Recombinant clones were identified as described. The resulting plasmid, 

PNS3(327-43o,sdgk,452-481), encodes a fusion protein of HCV NS3 helicase subdomain IIA 

20 (327-430,SDGK,452-481) carboxy-terminal to a polyHis tag. 

The pNS3(327-43o,sdgk,452-481) was used to transform E. Coli BL21(DE3), which were 
grown as described above in Example 1 . Protein expression was induced, cells were harvested 
3 hours after induction and then stored at -20 °C. 

The cell pellet was resuspended in lysis buffer (pH 7.5) and homogenized. The lysate 

25 was cleared and the supernatant added to 1 ml of Ni resin as described. The lysate and resin 
mixture was incubated for 1 hour at 7°C on a rotator. After 1 hour the resin was pelleted by 
centrifugation and resupended with 10 ml of pre-chilled wash buffer (20 mM HEPES, pH 6.5, 
25 mM imidazole. 0.2 mM DTT, 500 mM NaCl, and 10% glycerol). The resin was pelleted by 
centrifugation, and washed on a column with additional wash buffer. Protein was eluted with 

30 250 mM imidazole, 20 mM HEPES, 1 mM DTT, 500 mM NaCl, and 10% glycerol, pH 6.5. 10 
NIH units of thrombin were added per mg of fusion protein and the sample was dialyzed at 7°C 
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for 16 hours against 20 mM HEPES, pH 6.5, 1 mM DTT, and 10% glycerol. The sample was 
dialyzed against gel filtration buffer consisting of 75 mM potassium phosphate, 5mM DTT, 
0.015% sodium azide, pH 6.5 for 4 hours. After dialysis the sample was concentrated as 
described. Fractions containing HCV NS3 helicase subdomain IIA (327-430,SDGK,452-481) 
as judged by SDS-PAGE were pooled and concentrated as described above. This procedure 
yielded approximately 5 mg of highly pure HCV NS3 helicase subdomain IIA 
(327-430,SDGK,452-481) protein per liter of final E. coli growth culture. The protein was 
stored as described above. 

Example 3 

Construction, Expression and Purification ofHCVNS3 helicase subdomain IJIA 

The helicase fragment of this Example was prepared substantially as described in 
Example 1 , except as otherwise noted. 

pNS3(i8i-43o f sDGK,452-48i) was derived from plasmid pJC84. The gene encoding HCV 
NS3 helicase subdomain IJIA (181-430,SDGK,452-481; SEQ ID NO: 5), i.e., residues 
181-481 of helicase derived from HCV- la with residues 431-451 replaced by amino acids 
SDG K (SEQ ID NO: 2) , was constructed from pJC84 in two pieces. The DNA sequence 
encoding residues 181-430 was amplified with a Ndel site in the upstream primer and the 
nucleotides encoding the amino acid sequence SDGK (SEP ID NO: 2) in the reverse primer. 
The DNA sequence encoding residues 452-481 was amplified with the nucleotides encoding 
S-D-G-K f SEP ID NO: 2) in the forward primer and a Hindlll site in the reverse primer. The 
amplified DNA fragments were subjected to another round of PCR using the forward primer 
forresidues 181-430 and the reverse primer for residues 452-481. The products were digested 
with Ndel and Hindlll, purified, and ligated into pet28b(+). The ligation reaction was used to 
transform competent E.coli XL2-Blue and recombinant clones were identified as described. 
The resulting plasmid, pNS3(i8i-43o,sDGK,452-48i)> encodes a fusion protein of HCV NS3 helicase 
subdomain I,IIA (181-430,SDGK,452-481) carboxy-terminal to a polyHis tag. 

The pNS3(i8i-43o,sDGic,452-48i) was used to transform E.coli BL21(DE3), which were 
grown as described. When cell density reached an DD600 of 1 .5, recombinant protein 
expression was induced, and cells were harvested after 3 hours and stored at -20°C. 

The cell pellet was resuspended in lysis buffer (pH 8), and incubated at room 
temperature for 20 minutes with gentle stirring. The lysate was cleared and the supernatant 
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added to 1 ml of Ni resin per 100 ml of lysate as described. The lysate and resin mixture was 
incubated for 1 hour at 7°C on a rotator. After 1 hour, the resin was pelleted by centrifugation 
and resupended with lOx resin volume of pre-chilled wash buffer (1% 
n-octyl-P-D-glucopyranoside, 50 mM potassium phosphate, 50 mM imidazole. 0.2 mM DTT, 
300 mM NaCl, and 10% glycerol, pH 8). The resin was pelleted and washed on a column with 
additional wash buffer. Protein was eluted with 1% n-octyl-P-D-glucopyranoside, 250 mM 
imidazole, 1 mM DTT, 300 mM NaCl, and 20% glycerol, pH 7. 10 NIH units of thrombin 
were added per mg of fusion protein and the sample was dialyzed at 7°C for 16 hours against 
50mM potassium phosphate, 1 mM DTT, 300 mM NaCl, and 20% glycerol, pH 7. The sample 
was dialyzed against gel filtration buffer consisting of 75 mM potassium phosphate, 50 mM 
NaCl, 5 mM DTT, 0.015% sodium azide, pH 6.5 for 4 hours. After dialysis the sample was 
concentrated as described. Fractions containing HCV NS3 helicase subdomain 1,11 A 
(181-430,SDGK,452-481) as judged by SDS-PAGE were pooled and concentrated as 
described. This procedure yielded approximately 9 mg of highly pure HCV NS3 helicase 
subdomain I,IIA (181-430,SDGK,452-481) protein per liter of final E. coli growth culture. 
The protein was stored as described. 

Example 4 

Construction, expression, and purification ofHCVNS3 helicase subdomain IJIIA 

The helicase fragment of this Example was prepared substantially as described in 
Example 1 , except as otherwise noted. 

pNS3(i8i_327,483-572) was derived from plasmid pJC84. The gene encoding HCV NS3 
helicase subdomain IJIIA (181-327,483-572; SEQ ID NO: 6), i.e., residues 181-572 of 
helicase from HCV-la with residues 328-482 deleted, was constructed from pJC84 in two 
pieces. The DNA sequence encoding residues 181-327 was amplified with a Ndel site in the 
upstream primer. The DNA sequence encoding for residues 483-572 was amplified with the 
nucleotides encoding a Hindlll site in the reverse primer. The amplified DNA fragments were 
purified and subjected to another round of PCR using the same forward and reverse primers. 
The products were digested with Ndel and Hindlll, purified, and ligated into pET28b(+). 
E.coli DH5a were transformed with the plasmid, and selected on LB agar plates with 
kanamycin (30 jag/ml). Recombinant clones identified as described. The resulting plasmid, 
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pNS3(i si -327,483-572), encodes a fusion protein of HCV NS3 helicase subdomain IJIIA 
(181-327,483-572) carboxy-terminal to apolyHis tag. 

The pNS3(i si -327,483-572) was used to transform E. coli BL21(DE3), which were grown 
as described. At OD600 1-2 the culture was used to inoculate an M9 culture for expression. 
5 When the cell density reached ODeoo 0.7-1 .0, the temperature was adjusted to 16 °C, and 
protein expression was induced. Cells were harvested 16 hours after induction and frozen at 
minus 20°C prior to purification. 

The cell pellet was resuspended in lysis buffer (pH 8.4) and homogenized. The lysate 
was cleared and the supernatant was added to 4 ml/L culture of Ni 2+ resin as described. The 

10 lysate and resin mixture was incubated for 1 hour at 4°C on a rotator. After 1 hour, the resin 
was pelleted and resuspended in 10 ml of pre-chilled wash buffer (20mM Tris-HCl, 25 mM 
imidazole, 0.2 mM DTT, 500 mM NaCl, 0.1% BOG and 10% glycerol, pH 8). The resin was 
pelleted, washed on a column with additional wash. Protein was eluted with 250 mM 
imidazole, 1 mM DTT, 500 mM NaCl, and 10% glycerol, pH 8. Ten NIH units thrombin were 

15 added per mg of fusion protein and the sample was dialyzed at 4°C for 16 hours against 75 mM 
potassium phosphate, 1 mM DTT, pH 8. After dialysis the sample was concentrated to 
approximately 1 5 mg/ml and applied to a Siiperd e x SUPERDEX feel filtration material V200 
size exclusion column (26 x 60 cm, Amersham Pharmacia Biotech, NJ) equilibrated in gel 
filtration buffer containing 75 mM potassium phosphate, 5 mM DTT, pH 8. Fractions 

20 containing HCV NS3 helicase subdomain IJIIA (181-327,483-572) as judged by SDS-PAGE 
were pooled and concentrated as described. This procedure yielded approximately 1-2 mg of 
highly pure HCV NS3 helicase subdomain IJIIA (181-327,483-572) protein per liter of final E. 
coli growth culture. The protein was stored as described. 

Example 5 

25 Construction, expression and purification of HCVNS3 helicase subdomain I mutant 

The helicase fragment of this example was prepared substantially as described in 
Example 1 , except as otherwise noted. 

pNS3(i8i-324,R257E) was derived from plasmid pNS3(igi-324) with an Arg-to-Glu point 
mutation at position 257, i.e., amino acid residues 181-324 of HCV NS3 helicase from 
30 HCV- la (SEQ ID NO: 3) contained a single mutation at Arg-257, which was replaced by a 
glutamic acid. The plasmid pNS3(i8i-324,R257E) was generated by a 
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Ouickchange OUIKCHANGE f site-directe d mutagenesis kit) PCR reaction (Stratagene, 
Cloning Systems, La Jolla, CA) using primers having the sequence 
ATCAGGACCGGGGTGGAAACAATTACCACTGGC (SEQ ID NO: 15) and 
GCCAGTGGTAATTGTTTCCACCCCGGTCCTGAT (SEQ ID NO: 16). The reaction 
mixture was used to transform competent E. coli XL2-Blue, which were selected on LB agar 
plates with kanamycin (30|ag/ml). Recombinant clones were identified as described. The 
resulting plasmid, pNS3(i8i-324,R257E> encodes a fusion protein of HCV NS3 helicase 
subdomain I (181-324) carboxy-terminal to a polyHis tag and thrombin cleavage site, having a 
mutation at amino acid residue 257 (Arg) to a Glu. 

The pNS3(i8i-324,R257E) was used to transform E. coli BL21(DE3), which were grown as 
described above in Example 1 . Protein expression and purification are essentially the same as 
described in Example 1 . 

Example 6 

Oligomerization states ofHCVNS3 helicase subdomain I and subdomain I mutant 

The oligomerization states of helicase fragments corresponding to HCV NS3 helicase 
subdomain I (181-324) and subdomain I (181-324,R257E) mutant were determined using size 
exclusion chromatography. 

The molecular weight determinations of HCV NS3 helicase subdomain I and the 
subdomain I (181-324,R257E) mutant were performed in 75 mM KP04, pH 7.6, 5 mM DTT, 
0.015% NaN3 using a Superdex SUPERDEX (gel filtration material)- 75 gel filtration column 
(Amersham Pharmacia Biotech, Piscataway, NJ) at 4°C. Protein absorbance was monitored at 
280 nm. Molecular weight standards, Aprotinin (6.5 kDa), Cytochrome c (12.4 kDa), 
Carbonic anhydrase (29 kDa), Bovine serum albumin (66 kDa), and Blue dextran (2,000 kDa) 
(Sigma Chemical Co. Saint Louis, MO) were used to construct a plot of log (molecular weight) 
versus elution volume [e.g., Boyer, in Modern Experimental Biochemistry, 
Benjamin/Cummings, California, (1993)]. The concentrations of the protein samples (50 jal) 
used in these experiments were: 1 .0, 1 .6, 3.2, 10.0 mg/ml for HCV NS3 helicase subdomain I 
and 0.4, 2.0, 6.0 mg/ml for subdomain I (181-324,R257E) mutant. 

The elution volume for subdomain I (181-324) decreased from 14.7 ml to 13.7 ml as 
the protein concentration was increased from 1.0 mg/ml to 10 mg/ml, corresponding to an 
apparent molecular weight increase from 17 to 22 kDa. This suggests that subdomain I 
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(181-324) undergoes oligomerization with increasing concentration, i.e. subdomain I does not 
remain monomelic at higher concentrations, such as those tested. The elution volume of the 
subdomain I (181-324,R257E) mutant remained constant (at 14.5 ml) in increasing 
concentrations of protein (from 0.4 mg/ml to 6 mg/ml). These results indicate that the 
5 subdomain 1(181 -324,R257E) mutant remains monomelic at high protein concentrations, and 
therefore the R257E mutation can improve the solubility of HCV NS3 helicase subdomain I 
(181-324). 



of HCV NS3 helicase and HCV NS3 helicase subdomain I,IIA (181-430,SDGK,452-481) in 

which residues 431-451 are replaced with S-D-G-K (SEP ID NO: 2) were compared using a 

coupled spectrophotometric assay as previously described. 

To determine the K m for ATP, helicase construct (40 nM) was assayed at 25° C in 
15 0.103 M sodium Mops buffer, pH 7.2, 2.6 mM MgCl 2 , 0.28 mg/ml BSA, 0.4 mM DTT, 0.1 

mM EDTA, 1 mM Tris-Cl, 1 mM sodium Hepes, 2 mM PEP, 20 U/ml LDH, 10 U/ml PK, 0.17 

mM NADH, ± 525 polyU ([U]), plus 0.05, 0.1, 0.2, 0.4, 0.8 1.6 or 3.2 mM Mg-ATP. 

To determine the constructs 1 relative steady state affinities for RNA, 20 nM of each was 

assayed as described above for K m , with the following modifications: [Mg-ATP] was 10 mM; 
20 [MgCb] was 5.1 mM; and [U] was between 0 and 660 |aM. Table 1 summarizes the ATPase 

activity parameters and results. 
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Example 7 

Comparision of ATPase activities ofHCVNS3 helicase and subdomain IJIA 

The K m values for ATP and the apparent steady state affinities for single stranded RNA 



TABLE 1 



NS3 Helicase Construct 



HCVNS3 helicase 181-631 



HCV NS3 helicase subdomain IJIA 



NA" Independent kcat 1 .6 sec" 

NA" Independent K m . ATP 0.005 rrM b 

NA" Stimulated kc at 36.7 sec" 1 
NA" Stimulated K m . ATP 0.22 mM 

Fold Stimulation by NA" 23 



5.5 sec" 
2.1 mM 
23.6 sec* 
2.7 mM 




4.3 
260 uM U 



"PolyU . 

^reugschat et alJBiol Chem, 271: 24449-24457 (1996). 
^Concentration of polyU ([U]) resulting in 50% maximal stimulation. 



Example 8 

25 NMR sample preparation, NMR spectrum ofHCVNS3 helicase subdomain I 



V 
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For NMR studies, HCV NS3 helicase subdomain I was adjusted to 100 to 150 |uM with 
the addition of 10% D 2 0, and 0.4 mM AEBSF [4-(2-Aminoethyl)-benzenesulfonyl fluoride, 
which is an irreversible serine protease inhibitor with high water solubility] (SIGMA, St. Louis, 
MO). The final buffer of the NMR sample contained 75 mM KP0 4 , pH 8.0, 5 mM DTT, 10% 
5 D 2 0, and 0.4 mM AEBSF. Two-dimensional (2D) 15 N-HSQC NMR spectra were obtained on 
a 600 MHz Varian NMR spectrometer at 25°C. Sweep widths of 8000 Hz for l H, centered on 
the water resonance, and 1824 Hz for 15 N, centered at 1 19 ppm, were used. The data were 
collected with 16 scans with 64 or 128 tl increment points in the 15N dimension. HCV NS3 
helicase subdomain I was already aggregated at a concentration of 500 |iM, as clearly 
10 indicated by the increased peak linewidths in an 15 N-HSQC NMR spectrum. In contrast, the 
peak line widths at 1 50 |uM were typical for a monomeric protein of this size. The number of 
peaks and their dispersion in the 2D 15 N-HSQC NMR spectrum were indicative of a fully 
folded protein. 

Example 9 

15 NMR sample preparation and spectrum of HCVNS3 helicase subdomain II A 

For NMR, studies HCV NS3 helicase subdomain IIA (327-430,SDGK,452-481), in 
which residues 431-451 are replaced with S-D-G-K (SEP ID NO: 2\ was adjusted to 100 to 
250 ]LiM as described in Example 8. The final buffer of the NMR sample contained 75 mM 
KP0 4 , pH 6.5, 5 mM DTT, 10% D 2 0 and 0.4 mM AEBSF. An HSQC spectrum was obtained 

20 as described. The number of peaks and dispersion in a 2D 15 N-HSQC NMR spectrum of HCV 
NS3 helicase subdomain IIA were indicative of a fully folded protein. In addition, the line 
widths of the peaks in the NMR spectrum were consistent with a monomeric protein with a 
molecular weight of 1 5 kDa. 

Example 10 

25 NMR sample preparation and spectrum of HCVNS3 helicase subdomain IJIA 

After isolating HCV NS3 helicase subdomain IJIA (1 81 -430,SDGK,452-48 1) in which 
residues 431-451 are replaced with SDGK (SEQ ID NO: 2; described in Example 3), the 
protein was concentrated in a centrifugal filtration device to approximately 220 \xM. 
Deuterium oxide was added to a final volume of 10%. The final sample was approximately 

30 200 |LiM HCV NS3 helicase subdomain IJIA, 75 mM KP0 4 , 50 mM NaCl, 5 mM DTT, 
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0.015% sodium azide, pH 6.5. The sample was placed in a 500 MHz NMR spectrometer and 
equilibrated at 25 °C. Data were collected with 32 scans for each of the 120 points in the 
indirect dimension and an HSQC spectrum was obtained. Sweep widths of 8000 Hz for ! H, 
centered on the water resonance, and 1833 Hz for 15 N, centered at 1 19 ppm, were used. The 
number of peaks and dispersion in a 2D 15 N-HSQC NMR spectrum of HCV NS3 helicase 
subdomain IJIA were indicative of a fully folded protein. In addition, the line widths of the 
peaks in the NMR spectrum were consistent with a monomelic protein with a molecular 
weight of 30 kDa. 

Example 1 1 

NMR sample preparation and NMR spectrum of HCV NS3 helicase subdomain I, III A 

For NMR studies, HCV NS3 helicase subdomain IJIIA (181-327,483-572) in which 
residues 328-482 are deleted, was adjusted to approximately 100 |^M, as described in Example 
8. The final buffer was the same as Example 9. An HSQC spectrum was obtained as described. 
The number of peaks and dispersion in a 2D 15 N-HSQC NMR spectrum of HCV NS3 helicase 
subdomain IJIIA were indicative of a fully folded protein. In addition, the line widths of the 
peaks in the NMR spectrum were consistent with a monomelic protein with a molecular 
weight of 24 kDa. 

Example 12 

ATP binding to HCVNS3 helicase subdomain IJIA 

NMR titration experiments were performed to determine if HCV NS3 helicase 
subdomain IJIA (181-430,SDGK,452-481) retained ATP binding affinity. The dissociation 
constant (K^) of ATP was derived from an analysis of the changes in amide chemical shifts of 
residues in the binding site of the protein as a function of ATP concentration. ATP (0.0002, 
0.001, 0.005, 0.025, 0.05 M) was added incrementally to 200 ^iM [ 15 N]-labeled HCV NS3 
helicase subdomain IJIA in 75mM potassium phosphate, 50mM NaCl, 5mM DTT, 0.015% 
sodium azide, pH 6.5. Two-D l5 N-HSQC spectra of HCV NS3 helicase subdomain IJIA 
(181-430,SDGK,452-481) were collected after each addition of ATP. A binding affinity of 
7.68±0.03 mM was determined using data obtained from analyzing chemical shift perturbation 
data as a function of ATP concentration. The data supports the binding of a nucleotide as 
expected for a protein with NTPase activity. 
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The following describes the analytical method used to calculate ATP binding. theThe 
dissociation constant of ATP was derived from the amide chemical shift changes of protein 
residues at the binding site as a function of the concentration of ATP. For an interaction of a 
compound C with a protein R: 

C+R ^ CR 
_ Ktr _ [C][R] 

" Kn [CR] 

This equation can directly be correlated to chemical shifts as follows: 

d ~ d f\ ([C] 0 +[*l + K d ) - V([C] 0 + [ R] 0 +K d f - 4[C] 0 [R] 0 

(1) 



8,-d h 2[/?] 0 



7 

where [C]o and [R]o are the total concentrations of compound and protein, respectively, and 
10 [CR] is the concentration of the complex, 5 is the chemical shift of the protein measured at 

each concentration [CR], 5f is the chemical shift of the protein in the absence of the compound 

[C]o - 0, and 5b is the chemical shift of the protein at saturation with compound. 

Nonlinear regression methods were used to estimate Ka and 8b in the titration 

experiment. Data from an experiment consist of chemical shift (8) values measured at a 
15 number of different compound concentrations. The values of [C]o, [R]o> and Sf are known. 

Estimates of IQ and 8b are computed by fitting the data to Equation (1), supra using nonlinear 

least squares in the statistical package S AS (Institute Inc, Cary, NC). From a nonlinear fit, 

estimates of the standard errors were obtained for K<j and 5b. 

Example 13 

20 Backbone NMR resonance assignments and secondary structure of HCVNS3 helicase 
subdomain IIA (327-430,SDGK,452-481) 

[ 15 N]- and [ 15 N/ 13 C]-labeled NMR samples of HCV NS3 helicase subdomain IIA 
(327-430,SDGK,452-48 1 ; SEQ ID NO: 4) in which residues 43 1 -45 1 are replaced with SDGK 
(SEQ ID NO: 2) were prepared to obtain sequential resonance assignments. The protein 

25 concentration was about 0.6 mM in a buffer system containing 75 mM KP04, pH 6.5, 5 mM 
DTT, 5% D 2 0, 0.4 mM AEBSF, and 0.015% NaN 3 . 15 N-HSQC, 3D 15 N-edited 
NOESY-HSQC and 15 N-edited TOCSY-HSQC NMR spectra were acquired using a uniformly 
[ 15 N]-labeled sample. 3D triple resonance experiments, such as HNCO, HNCACB, 
CBCA(CO)NH, and (H)C(CO)NH-(TOCS Y) were acquired using a uniformly 
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[ N/ C]-labeled sample. The sample for the hydrogen-deuterium exchange experiments was 
prepared by dissolving a lyopholized protein sample into 99.99% D2O at a concentration of 0.5 
mM in 75 mM KiP0 4 , pH 6.5, 5 mM DTT, 0.4 mM AEBSF and 0.015% NaN 3 . The protein 
sample was immediately placed in the NMR spectrometer, and a series 1J N-'H HSQC spectra 
5 were collected over time. All NMR experiments were performed on a Varian INOVA 500 
MHz spectrometer at 25°C. NMR data were processed using FELIX98 (MSI, San Diego) and 
analyzed using NMRView on a SGI workstation. 

Sequential assignments of *H N , 15 N, 13 C a and 13 C P were derived from the 3D HNCACB 
and CBCA(CO)NH NMR triple-resonance experiments by analyzing the sequential 
10 connectivities of 13 C a and l3 C p chemical shifts. 13 C chemical shifts were obtained from a 3D 
HNCO NMR experiment. Jhnho. coupling constants were obtained from analysis of a 3D 
HNHA data set. 

131 of the expected 132 non- proline backbone amide 15 N/'H N NMR resonances have 
been sequence-specifically assigned (see Table 2, infra). In addition, backbone C, C' a and 

1 5 13 C P (for non-glycine residues) NMR resonances have been assigned. 3 Jhnh(x coupling 

constants were obtained for 80 out of the 132 non-proline residues. 51 amide proton signals 
were detected during the hydrogen-deuterium exchange experiments. Preliminary analysis of 
15 N-edited NOE spectra indicates that the overall fold of this isolated domain is similar to the 
corresponding part in the crystal structure of full-length HCV NS3 helicase. 

20 The chemical shift index (CSI) method was used to predict the secondary structure of 

HCV NS3 helicase subdomain IIA (327-430,SDGK,452-481) in which residues 431-451 are 
replaced with S-D-G-Kt (SEP ID NO: 2Y The chemical shift index was calculated using the 
CSI program (Wishart and Sykes, 1994, software provided by Wishart) using 1 H ct , 13 C, 13 C' a 
and 13 C P chemical shifts to determine well-defined regions of P -sheet and a-helical secondary 

25 structure (Fig. 3). The CSI indicates that residues 336-339, 353-359, 363-367, 387-391, 

406-41 1, 424-427, and 471-477 are in the p-sheet conformation, whereas residues 371-381 and 
455-462 are in a a-helical conformation of SEQ ID NO: 1 . This secondary structure prediction 
is in very good agreement with the secondary structure elements that are observed for the 
corresponding part in the crystal structure of full-length HCV NS3 helicase [Yao et ah 9 Nat 

30 Struct Biol, 4:463-467 (1997)]. Although there are some differences between the starting and 
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ending residues in the secondary structure elements predicted by the CSI when compared to 
those of the crystal structure, all differences are within the accuracy of the CSI method. There 
are however two regions of secondary structure that are not predicted by the CSI; in the crystal 
structure of full-length HCV NS3 helicase residues 347-349 and 356-359 of SEQ ID NO: 1 are 
in a P-sheet and a-helical conformation, respectively. Nevertheless, NOE, coupling constant, 
and amide exchange data for these residues are consistent with the secondary structure of the 
crystal structure. Strong da N (i,i+i) NOEs were observed for residues 347-349 with large 3 Jhnh<x 
coupling constants (7.6 Hz and 7.5 Hz for residue 347 and 349, respectively) which is 
consistent with residues 347-349 adopting a p-sheet conformation. Strong d N N(u+i) NOEs were 
observed for residues 356-359, and residue 356 showed d a N(i,i+2) and d a N(i,i+3)NOEs. In addition, 
small 3 JHNHa coupling constants of 2.2 Hz and 2.1 Hz were detected for residues 356 and 357, 
respectively. Moreover, the hydrogen-deuterium exchange experiments revealed that residue 
359 is highly protected from the solvent solution suggesting that its amide proton is hydrogen 
bonded. These data suggest that residues 356-359 form a P-turn conformation like in the 
crystal structure. 

The following Table 2 contains the backbone NMR resonance assignments of HCV 

NS3 helicase subdomain IIA (327-430,SDGK,452-48 1 ; SEQ ID NO: 4) in which residues 

43 1-45 1 are replaced with amino acids SDGK (SEQ ID NO: 2). The table contains one line for 

each residue. From left to right, the columns indicate residue number, 3-letter amino acid code, 

chemical shift of 1 H N , chemical shift of 15 N, chemical shift of 13 C a , chemical shift of 13 C P , and 

i ^ 

chemical shift of C (n.a., not available; n.d., not determined). 

TABLE 2 



resi_num 


AA 


HN 


15N 


CA 


CB 


c 


327 


GLY 


8.42 


111.17 


45.12 


n.a. 


173.42 


328 


SER 


8.21 


116.62 


57.75 


63.30 


172.89 


329 


VAL 


8.17 


122.36 


61.66 


32.35 


171.20 


330 


THR 


8.23 


119.53 


61.30 


68.60 


173.36 


331 


VAL 


8.19 


125.27 


59.07 


32.11 


n.d. 


332 


PRO 


n.a 


n.d. 


62.91 


31.69 


171.25 


333 


HIS 


8.62 


121.70 


52.78 


29.53 


n.d. 


334 


PRO 


n.a 


n.d. 


62.91 


31.58 


170.91 


335 


ASN 


9.01 


120.94 


52.84 


39.01 


173.36 


336 


ILE 


7.80 


120.24 


59.89 


41.06 


172.30 


337 


GLU 


8.17 


129.72 


54.71 


30.94 


172.61 
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338 


GLU 


8 91 


127 02 

1 1 * w x_ 


54 71 


30 11 

V*/ \y all 


170 81 


339 
\j \j <j 


VAL 


9 41 


126 64 


60 01 


34 82 


173 64 


340 


ALA 


8 46 


132 53 


51 18 


18 22 


167 88 


341 

0*T 1 


LEU 


8 22 


123 16 

1 ^ J . IV 


54 84 


42 71 


170 58 


342 


SER 


9 08 


120 52 


55 19 


64 96 


173 82 


343 


THR 


8 01 


106 69 


61 66 


67 56 


173 00 

1 f J ■ V/V/ 


344 


THR 

X X XXV 


8 12 


1 1Q 72 


61 54 


67 60 


174 77 

1 f *T • f f 


345 


GLY 

\— * X-l X 


7 QQ 


1 13 72 


43 41 


n a 

1 1 >C1. 


175 15 


34R 
o*+u 


GLU 


8 Qfi 


121 04 


59 66 


29 88 

.\J\J 


170 68 

1 f u>uu 


347 


ILE 


7 7fl 
/ . f u 


1 18 Q7 

1 1 O.J f 


56 13 


38 24 


n r! 

1 1 .u. 


348 

OtO 


PRO 

X7 


n 9 
1 1 .a. 


n H 
1 1 .u . 


62 Q1 


31 69 

O 1 .UC7 


1 70 56 

1 f U.JU 


34Q 


PHE 

XT X 1JJ 


8 4Q 


195 67 

1 Z. J.U I 


58 24 


41 05 


1 74 54 

1 / *T. J*T 


350 

O JU 


TYR 


8 36 


194 39 


59 60 


35 56 


172 48 


351 
O J 1 


GLY 


8 21 


105 55 


44 71 


1 I.Cl. 


173 05 

1 f O.UJ 


352 


LYS 


8 01 

U.U 1 


122 25 


52 13 


32 1 1 


172 38 

1 f c. JU 


353 


ALA 


8 47 
u - ■ f 


125 53 


51 07 


21 05 


172 56 

1 I LiJU 


354 


ILE 

<X. XJXJ 


9 09 


121 24 


57 18 


41 30 


n ri 

1 1 .u . 


355 

vJ J 


PRO 


1 1 .CI - 


n d 

1 I.U . 


60 25 

\J\J - £-\J 


31 41 


170 01 

I i u iVy 1 


356 


LEU 


9 19 


128 23 


56 83 


41 41 

1 - I 1 


168 70 

1 UU • 1 \J 


357 


GLU 


8 77 


1 13 Q1 


58 60 


28 94 


169 09 


358 


VAL 

V X 


7 03 


108 58 

1 \J\J , JU 


61 31 


30 23 


170 92 


35Q 


ILE 

■X* X~JX_J 


7 11 
f . i i 


1 1 1 Q6 


59 54 


38 47 

OU.T f 


173 72 
i f j . f *- 


350 


LYS 

X_l X> 


6 93 


122 62 


57 07 


31 76 


171 79 


361 


GLY 

\JXJ X. 


7 78 
/ . / u 


1 14 24 


43 8Q 


1 I.Cl. 


173 54 

1 1 O.J*t 


3R2 


GLY 


8 24 


1 1 9 47 

1 1 C .*+ f 


43 8fi 


n a 

1 I.Cl. 


174 44 


3R3 


ARG 


8 5Q 

U. JC7 


121 41 


54 94 


31 2Q 

O 1 


179 43 

1 / t.td 


3R4 


HIS 


8 QO 


124 02 


54 3R 


35 05 
oo.uo 


1 75 69 

1 r J.U^ 


3R5 


T.FTT 


ft 05 

O.U J 


19R ftfi 


S9 OA 


43 30 


174 33 

1 f *T.OO 


3fifi 
ouu 




Q 35 

U.O J 


19R Q9 


07 
oy.u f 


40 19 

HU. 1 ^ 


172 53 

1 f C .JO 


3R7 

OU f 


PHE 


Q 19 


194 R5 


5R Q5 


41 78 


170 RO 
1 f u.uu 


3R8 


PYS 


8 75 


1 1R 8R 

1 ID.OD 


5R Q5 


31 53 

O 1 .JO 


171 RR 
1 f 1 .uu 


3RQ 

OUJ7 




8 45 
o.*+ j 


1 1 7 7R 


5R Q5 


31 9Q 


1 73 RQ 

1 f O .UC7 


370 


SER 


R 81 

U.O l 


1 1 9 85 


54 98 

0*T.^O 


66 99 


173 81 

1 f O.U 1 


371 

O f 1 


LYS 

XJ X k-J 


8 78 
O. f o 


194 QD 


5Q 14 

OC7. 1 *+ 


31 04 

O I .U*T 


1 70 5R 

1 / U.JU 


372 

O f £- 


LYS 

XJ X 


7 87 


1 1Q 90 


58 79 

JO. 1 c 


31 88 

O 1 .OU 


1R8 1Q 

1 UU. 1 C7 


373 

J f J 


LYS 

XJ X w 


7 53 

f .JO 


1 1Q 1Q 

1 1 C7. 1 C7 


56 79 


30 82 


168 71 

1 UU • i 1 


374 

J f " 


CYS 


7 68 
f .uu 


1 18 49 


R9 48 


26 46 


173 02 

1 / J . \J £- 


375 


ASP 

ilk-? XT 


8 13 


1 1 Q 79 

I I \3. f ^ 


56 Q5 

OU.C70 


39 64 


167 8fi 

1 U f .UU 


376 

u f u 


GLU 


8 13 

U> 1 o 


121 70 


58 60 

OU .\J\J 


29 88 


168 06 

1 UU .UU 


377 

Off 


LETT 


8 76 


199 99 


57 30 

O r .OU 


40 83 


1R8 57 

1 UU. J f 


378 


ALA 


8.72 


120.99 


55.66 


16.46 


168.96 


379 


ALA 


7.53 


117.84 


54.60 


17.17 


180.22 


380 


LYS 


7.77 


121.24 


58.36 


31.88 


168.84 


381 


LEU 


8.21 


118.64 


57.42 


40.47 


167.83 


382 


VAL 


8.59 


122.91 


65.66 


31.29 


180.14 


383 


ALA 


7.71 


124.07 


54.13 


17.29 


168.45 



384 


LEU 


7 42 


118 63 


54 12 


42 12 


170 48 


385 

V V Vy 


GLY 


8 03 

vy ■ v/ vy 


108 26 

1 V/ V/ « at* V/ 


45 06 


n a 


173 38 

1 * vy • vy vy 


386 

V^r V/ 


ILE 


7 91 


123 56 

1 aW VV • vu 


57 54 


37 17 


171 89 


387 

vy Vy P 


ASN 


8 72 

W.I b 


126 25 
i ^ w . ^ w 


52 13 

^/X> t 1 w 


36 95 


174 46 

1 1 r • r vy 


388 

vy vy vy 


ALA 


7 16 


128 26 


49 30 


22 94 


1 70 76 

1 1 v/ « 1 v/ 


389 

vy vy w 


VAL 


8 72 

W.I £— 


1 19 64 


59 30 


35 53 


174 56 

1 # r * vy vy 


390 

kJ \J \J 


ALA 


8 33 

W • WW 


128 47 
i ^w . 1 1 


49 41 


20 11 


171 12 


391 


TYR 


8 66 

W • KJ KJ 


1 18 73 


59 07 


41 53 

1 1 - vy W 


176 18 

1 f w. 1 w 


392 

W \J L— 


TYR 


5 38 


120 76 

1 ^ KJ . f W 


52 95 


39 08 

w vy - w w 


173 79 

1 1 w • ■ w 


393 

WW W 


ARG 


8 17 


1 18 36 

1 1 %J .WW 


57 89 


29 40 

£_ vy ilU 


171 27 

III ■ f 


394 


GLY 


8 81 


1 15 23 


44 00 




172 76 

1 / m 1 vy 


395 


LEU 


7 53 


121 19 


53 06 


42 36 

" L— -WW 


171 32 

III ■ W £— 


396 


ASP 


8 61 

O.W 1 


123 75 


53 19 


42 83 

" . W vy 


170 37 

1 f w . w / 


397 

W vy I 


VAL 


a 4Q 


123 93 


63 90 

\J\J • c w 


31 53 

W 1 . vy vj 


170 43 

1 1 W ■ t W 


398 

WWW 


SER 


8 68 

w. ww 


1 18 83 


59 78 

iyj> 1 u 


62 25 


172 10 

1 I ^ • 1 v/ 


399 


VAL 


7 82 

f . W L— 


1 18 25 


62 95 

KJ . w W 


31 40 

W 1 « T W 


171 30 

II 1 ■ W \J 


400 

r W W 


ILE 


7 53 


122 09 


58 60 

vy W .UU 


37 76 


n d 

II* VjI • 


401 

T W 1 


PRO 


n a 

I i<o> 


n d 


61 81 

vy 1 - vy 1 


31 43 


168 40 

1 vy vy ■ r v/ 


402 

■ KJ £— 


THR 


8 78 


1 18 50 


61 89 

w 1 • w w 


68 58 


173 90 

1 ■ vy * vy 


403 


ASN 


7 88 

I . W KJ 


1 18 02 


51 30 


41 41 


172 59 

1 1 b • w w 


404 

~ v/~ 


GLY 


8 48 

W . i W 


109 16 


43 65 

1 vy - vy vy 


n a 


174 85 

1 I »^ > vy vy 


405 


ASP 


8 19 

W . 1 C 


119 41 


54 83 

vy 1 . vy W 


40 35 

1 W 1 kJ W 


170 56 

1 f W . WW 


406 

i KJ w 


VAL 


8 30 


121 41 


60 83 


33 06 

WW. WW 


176 46 

1 1 W . 1 w 


407 


VAL 


8 37 

W.W / 


126 86 

1 >uvy 


60 60 

W vy • vy W 


33 18 

WW. 1 w 


172 63 

1 i • vy vy 


408 

T W W 


VAL 

V X * * * 


9 21 

W . 1 


130 04 


60 10 

uu . 1 w 


31 45 

w 1 w 


173 87 

1 f W • W 1 


409 

"TV W 


VAL 


ft 99 

O > w w 


130 38 


60 36 

w W . vJ vj 


31 64 

\J 1 . w" 


172 90 


410 

■t I W 


ALA 


9 39 

W . W W 


130 48 


50 48 

\J\J .iKJ 


26 00 


169 94 

1 W vy . W^ 


41 1 


THR 




10fi 87 

1 UU.U f 


5Q 07 


70 14 


178 57 


412 


ASP 

XT 


ft ft1 

w.w 1 


1 18 no 


58 on 




170 20 

1 / W t£J\J 


413 

" 1 W 


AT. A 

nun 


8 15 


124 18 


53 42 


18 46 


168 58 

1 WW . WW 


414 


LEU 


ft 10 


1 18 9Q 


56 60 


42 12 


173 41 

1 / W . " 1 


415 


MET 


8 03 


116 14 


56 40 

v^u ,*tU 


31 03 


170 26 

1 I w *^w 


416 

T 1 W 


THR 

X^ X 11^ 


7 67 


1 10 80 


62 48 

KJ C .*tU 


68 62 

UU . KJ c 


171 87 

1 1 1 - w 1 


417 


GLY 


8 06 
w . w w 


110 11 


45 26 

" vy . ^ \J 




174 41 
1 1 ~ • 1 1 


418 

~ I vy 


PHE 

X> X 1U 


7 98 

/ • W W 


121 40 

1 £- 1 . 1 \J 


56 83 

v'U .WO 


41 18 

T 1 . IU 


171 87 

if 1 * \J 1 


419 

T 1 W 


THR 

X X XXV 


8 13 

W . 1 W 


109 56 


61 68 


68 72 


173 61 

1 1 w • w 1 


420 


GLY 

VJ XW nX 


6 35 

W . W vj 


109 86 


44 71 




176 16 

1 f w - 1 w 


421 


ASP 


7 95 

f • v7 W 


1 19 34 


52 71 


42 70 


171 89 

III .WW 


422 


PHE 

X> X JIJ_I 


8 60 


1 18 07 

1 1 KJ . \J 1 


58 01 

vy W . W 1 


42 47 


171 63 

III .WW 


423 


ASP 

IlL^ XT 


9 51 


125 51 


56 83 


40 82 

" w . w t 


170 94 

1 1 W • W™ 


424 


SER 


7.78 


1 12.40 


57.42 


65.90 


,4^1— -7-7 

175.77 


425 


VAL 


8.71 


120.58 


59.79 


37.17 


173.50 


426 


ILE 


9.55 


127.82 


60.01 


38.24 


173.64 


427 


ASP 


8.85 


128.10 


52.60 


44.47 


172.55 


428 


CYS 


7.23 


121.09 


57.73 


27.70 


173.33 


429 


ASN 


9.15 


115.98 


54.69 


37.71 


174.18 
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430 


THR 


7.35 


110.13 


60.34 


69.60 


174.74 


s 


SER 


8.71 


119.64 


56.62 


63.09 


173.15 


D 


ASP 


8.95 


125.46 


54.37 


39.25 


171.61 


G 


GLY 


8.15 


107.44 


45.01 


n.a. 


173.66 


K 


LYS 


7.58 


121.42 


53.45 


32.16 


n.d. 


452 


PRO 


n.a. 


n.d. 


62.80 


31.37 


170.76 


453 

■ \^ x^ 


GLN 


8.24 


124.32 


55.10 


30.10 


171.74 


454 

* x^ * 


ASP 


8 36 


127 08 

■ MmM 1 • XX ^X 


52 01 

XX ^» » XX I 


41.31 


171.58 


455 

■ x^ 


ALA 


8.39 


121.98 


54.58 


18.07 


180.99 


456 

■ X^ X^ 


VAL 


7 66 

■ ■ XX XX 


120 05 

■ ^mm XX * xx vx 


65.49 

XX XX * I XX 


30.82 


169.22 


457 


SER 


7.98 


117.30 


60.24 


61.68 


170.92 


458 


ARG 


7.94 


121.36 


59.90 


30.07 


169.84 


459 


THR 


7.84 


113 66 

■ ■ xx ■ xx xx 


65.90 


67.78 


170.22 


460 

r xx x^ 


GLN 


8 12 

XX ■ ■ 


122 78 

■ ^b^h * 9 XiX 


58 13 

X,X V • • VX 


27.52 


168.50 

I XX XX • XX XX 


461 


ARG 


8 20 

XX • XX 


120 90 

1 ^ XX ■ XX XV 


60 65 

XX XX • VX xx 7 


30.07 


169.68 


462 


ARG 


8.04 


118.88 


58.71 


29.86 


171.69 


463 


GLY 


7.32 


129.17 


45.06 


n.a. 


172.41 


464 


ARG 


7.44 


118.56 


54.30 


28.89 


173.12 


465 


THR 


7.78 


114.86 


59.17 


70.46 


174.82 


466 


GLY 


8.38 


110.78 


45.65 


0.00 


172.05 


467 


ARG 


8.51 


121.48 


55.80 


28.57 


170.63 


468 


GLY 


8.54 


112.37 


45.06 


n.a. 


174.78 


469 


LYS 


7.58 


119.95 


53.53 


31.29 


n.d. 


470 


PRO 


n.a. 


n.d. 


63.34 


31.48 


170.09 


471 


GLY 


8.75 


111.00 


44.47 


n.a. 


174.46 


472 


ILE 


7.26 


121 16 


60 01 

XX XV • XX ■ 


41.81 


174.77 


473 


TYR 


9 23 

XX ■ ^— Xb* 


130 81 

1 vx xx » vx ■ 


54 95 

X^ 1 • XX x«^ 


42 00 

■ ■ XX XX 


174.09 


474 


ARG 


9 52 


126 87 

■ XX • VX ■ 


52 79 

V4« ■ * XX 


30 94 

VX XX • VX ¥ 


172 28 

■ 9 m^m • 9mm VX 


475 

I w X^ 


PHE 


6 98 

XX^ m XX X»^ 


115 72 

■ ■ VX • f 


54 00 

Vx T * XX VX 


40 23 


0 00 


A 7C 

4/0 


T7AT 


o.uy 


A OH OA 

Izl .oil 


bo. 44 


OO 07 

oZ.of 


1/1 .yo 


477 


ALA 


8.98 


128.74 


48.95 


19.88 


n.d. 


478 


PRO 


n.a. 


n.d. 


62.27 


31.90 


170.71 


479 


GLY 


8.28 


108.97 


44.09 


n.a. 


173.96 


480 


GLU 


8.24 


121.72 


56.12 


30.11 


171.81 


481 


ARG 


7.97 


127.50 


56.71 


31.17 


n.d. 



Example 14 

Crystallization ofHCVNS3 helicase subdomainl 

E, coli derived HCV NS3 helicase subdomain I (i.e.. amino acids 181-324 : SEP ID 
NQf- of HCV NS 3) was expressed and purified as described. Purified HCV NS3 helicase 
subdomain I (60 mg total) was dialyzed against a 75 mM Tris, pH 8.0, 100 mM sodium 
chloride, 5 mM dithiothreitol solution and concentrated by centrifugal filtration to 0.12 mM 
(16 mg/ml) followed by ultracentrifugation prior to crystallization. Vapor diffusion 
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crystallization experiments were conducted using the hanging drop method. Crystals suitable 
for structure determination were grown from a droplet containing 2 jal of protein: 2 (il of the 
reservoir solution (100 mM MES, pH 5.4, 20% 2-methyl-2,4-pentanediol (MPD), 5 mM 
p-dithiothreitol). Crystals were incubated on rectangular crystallization plates ( 0.01 x 0.05 x 
0.1mm) at 4°C over 1-4 weeks. 



Crystallization ofHCVNS3 helicase subdomain I by microseeding 

Vapor diffusion crystallization experiments were conducted as described in Example 
12, except the hanging drop method was supplemented by micro-seeding. Crystals suitable for 
structure determination were grown from a droplet as described. The droplet was 
micro-seeded with a HCV NS3 helicase subdomain I crystal at 22°C. Crystallization plates 
were incubated at 4°C on rectangular plates (0.02x0.10x0.2 mm) and grown over 1 -4 weeks. 



Crystallographic analysis of HCVNS3 helicase subdomain I 

Prior to data collection, crystals were either taken directly from the crystallization 
droplet in crystal storage solution and by either addition of 20% glycerol or increasing MPD 
concentration to 20% were flash frozen using either nitrogen gas stream or liquid propane. A 
complete diffraction data set from a HCV NS3 helicase subdomain I (181-324) crystal was 
achieved from a synchrotron radiation facility in IMCA beamline, APS, Chicago, USA. 

Crystals belong to the primitive monoclinic space group P2j . The unit cell dimensions 
are a=34.8 A, b=67.1 A, c=58.4 A, oc=90°, P=101.3°, y=90° with two molecules in the 
asymmetric unit. Most crystals diffract beyond 1.9 A. Table 3 shows the data collection 
statistics. 



Example 15 



Example 16 



TABLE 3 



Resolution 

No. of collected reflections 

No. of unique reflections (F >= 0) 

R-sym 

Percent of theoretical (I/s >= 1) 
Unit Cell 
Space Group 
Asymmetric unit 



40-1.9 A 

608721 

19772 

0.068 

93.9% 

a=34.8 A, b=67.1 A, c=58.4 A 
P2i 

2 molecules 
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Model Building and Refinement 

HCV NS3 helicase subdomain I (181-324) structure was determined by molecular 
replacement methods as coded in XPLOR. The 2Fo-Fc map showed the C-termini of helicase 
in the active site of protease. The structure was further refined using simulated-annealing, and 
5 positional and B-factor refinement (XPLOR 3.1), while gradually extending the resolution. 
Both search models were derived from the HCV strain la, and the appropriate changes 
corresponding to the IB strain of the HCV NS3_helicase subdomain I (181-324) were made 
after the resolution or refinement was beyond 1 .9 A. The Rf ree [Brunger, Meth Enzy> 
276:558-580 (1997)] was closely monitored throughout the refinement. Table 4 shows the 
10 refinement data statistics of the HCV NS3 helicase subdomain I (181-324). 

TABLE 4 

Parameter Value 
Rfree 1073 unique reflections (40.0 to 1 .9 A res.) 0.40 
R- factor of 1 8626 unique reflections 0.32 
Rms deviation from ideal bond distances (A) 0.006 
Rms deviation from ideal angle (°) 1.59 
Protein heavy atoms 2054 



Table 5 contains one line for each atom in one HCV NS3 helicase NTPase domain 
monomer (SEP ID NO: 17. Le„ residues 181-324 from HCV NS3 helicase subdomain I) . 
From left to right, the columns indicate residue number, 1 -letter amino acid code, atom name, 
15 x-coordinate (A) multiplied by 10, y-coordinate (A) multiplied by 10, z-coordinate (A) 

multiplied by 10, and B-factor. The coordinates of the second monomer (x 2 , y 2 , Z2) are related 
to the coordinates of the first monomer (xj, yi, z\) listed below according to the following 
operation: 

x 2 = xran + yra ]2 + z\ -ai 3 + tj; 
20 y 2 = xr a 2 i + yr a 22 + z\ *a 23 + 1 2 ; 

z 2 = xra 3 i + yra 32 + z\ -a 33 + 1 3 , where 
an a !2 a u - 0.9252 0.0230 0.3787; 
a 2 i a 22 a 23 = 0.0210 -0.9997 -0.0095; 
a 3 i a 32 a 33 = 0.3788 0.0008 -0.9255; and 
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t, t 2 t 3 = -15.68 31.56 79.47 (expressed in A). 

TABLE 5 
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Based on the structural data set forth in Table 5, one having ordinary skill in the art can 
determine the crystalline structure of a crystal from subdomain I of HCV helicase protein. 



* m\» ^» 
*T* *T* *T* *T* 



The descriptions of the foregoing embodiments of the invention have been presented 
for purpose of illustration and description. They are not intended to be exhaustive or to limit 
the invention to the precise forms disclosed, and obviously many modifications and variations 
are possible in light of the above teaching. The embodiments were chosen and described in 
order to best explain the principles of the invention to thereby enable others skilled in the art to 
utilize the invention in various embodiments and with various modifications as are suited to 
the particular use contemplated. It is intended that the scope of the invention be defined by the 
claims appended hereto. 
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WHAT IS CLAIMED IS: 

1 . A polypeptide fragment of an HCV helicase protein, which is derived from a 
subdomain of the HCV NS3 helicase protein, wherein the fragment is less than 30 kDa, 
structurally sound, soluble, monodisperse, and stable in a buffered solution. 

2. A polypeptide fragment of claim 1, which is suitable for use in nuclear magnetic 
resonance. 

3. A polypeptide fragment of claim 1, wherein the subdomain comprises amino acids 181 
to 324 of the HCV helicase protein. 

4. A polypeptide fragment of claim 3, which comprises a single amino acid substitution, 
the amino acid selected from Asp 249 and Arg 257, wherein the substitution is a nonpolar 
amino acid. 

5. A polypeptide fragment of claim 4, wherein the substitution for Asp 249 is lysine or 
arginine, and wherein the substitution for Arg 257 is glutamic acid or aspartic acid. 

6. A polypeptide fragment of claim 1 , wherein the subdomain comprises amino acids 327 
to 481 of the HCV helicase protein, and wherein the amino acid residues at positions 431 to 
451 are deleted and replaced by an amino acid sequence selected from SEQ ID NOS: 2, 7, 8, 9, 
10, 11, 12, 13, and 14. 

7. A polypeptide fragment of claim 1, wherein the subdomain comprises amino acids 181 
to 481 of the HCV helicase protein, and wherein the amino acid residues at positions 431 to 
45 1 are deleted and replaced by an amino acid sequence selected from SEQ ID NOS: 2, 7, 8, 9, 
10, 11, 12, 13, and 14. 
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8. A polypeptide fragment of claim 1 , wherein the subdomain comprises amino acids 181 
to 572 of the HCV helicase protein, and wherein the amino acid residues at positions 328 
through 482 are deleted. 

5 9. A polypeptide which comprises an amino acid sequence selected from SEQ ID NO: 3, 
SEQ ID NO:4, SEQ ID NO:5, and SEQ ID NO:6, wherein the polypeptide comprising SEQ ID 
NO:3, SEQ ID NO:5 or SEQ ID NO:6 optionally comprises a single amino acid substitution 
selected from Asp 249 and Arg 257, wherein the substitution is a nonpolar amino acid. 

10 10. A polypeptide of claim 1, which is in a complex with a small molecule HCV helicase 
inhibitor. 

11. A crystalline composition, comprising a polypeptide fragment derived from subdomain 
I. 

15 

12. The crystalline composition of claim 11, wherein the polypeptide fragment further 
comprises a single amino acid substitution, the amino acid selected from Asp 249 and Arg 257, 
wherein the substitution is a nonpolar amino acid. 

20 13. The crystalline composition of claim 12, wherein the substitution for Asp 249 is lysine 
or arginine, and wherein the substitution for Arg 257 is glutamic acid or aspartic acid. 

14. The crystalline composition of claim 12, wherein the polypeptide fragment comprises 
an amino acid sequence of SEQ ID NO: 3. 

25 

15. A crystalline composition comprising the structural coordinates set forth in Table 5. 

16. A precipitant solution comprising from 1 to 60 jxg of an HCV helicase fragment, from 5 
to 35% weight to volume of a precipitant compound, from 1 to 1000 mM of a salt, a buffer for 

30 a precipitant solution and optionally a protein stabilizing agent, wherein the pH of the solution 
is from about 4 to 7 and the temperature is from about 1 to 26°C. 
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17. A buffered solution comprising from 50 to 1000 of an HCV helicase fragment, 
from 5 to 15% weight to volume of D2O, a protease inhibitor, 25 to 250 mM KPO4, optionally 
about 25 to 250 mM sodium chloride, optionally about 0.010 to 0.020% sodium azide and 1 to 

5 10 mM DTT, wherein the pH of the solution is from about 4 to 8. 

18. A method for preparing a purified crystalline composition which comprises a purified 
polypeptide fragment of claim 1, comprising the steps of: 

stabilizing the purified polypeptide fragment in a solution comprising a protein 
10 stabilizing agent; 

purifying the polypeptide from the solution by applying the solution to an anion 
exchange chromatography column and collecting the fraction containing the polypeptide; and 
crystallizing the polypeptide in the fraction by allowing crystals to form a precipitant 
which comprises a protein stabilizing agent, a salt and a precipitant compound under 
1 5 conditions which permit crystallization. 

19. The method of claim 18, wherein the polypeptide fragment comprises an amino acid 
sequence of SEQ ED NO: 3. 

20 20. A method for identifying an inhibitor compound of an HCV helicase protein, 
comprising the steps of: 

obtaining a helicase polypeptide fragment which is contacted with a compound, 
wherein the fragment comprises at least subdomain I or subdomain II of the HCV NS3 helicase 
protein; 

25 assaying the fragment which is contacted with the compound and assaying the HCV 

helicase protein for activity; 

and comparing the activity of the fragment in contact with the compound to the activity 
of the HCV helicase protein, wherein a decrease in the activity of the fragment compared with 
the HCV helicase protein identifies the compound as an inhibitor of HCV helicase activity. 



30 
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ABSTRACT 

This invention provides fragments of HCV NS3 helicase, and crystalline compositions 
thereof, based on subdomains of HCV helicase protein. The protein fragments are stable, 
5 soluble, and structurally sound. They can be expressed at high levels in conventional 

expressions systems, such as E. coli, to permit efficient, large-scale production for NMR-based 
screening applications and production of [ H, C, N]- and [ C, N]-labeled polypeptides for 
structural NMR studies. Helicase fragments of the present invention are useful in the most 
advanced NMR techniques available, e.g., NMR-based drug discovery techniques such as 
10 SAR-by-NMR, in biological assays to discover inhibitors of HCV NS3 helicase, and to 
evaluate the mechanism of action and substrates for HCV NS3 helicase. Crystals of the 
present invention are useful for structure-based drug design studies using x-ray 
crystallographic techniques. 



